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BACKGROUND OF THE INVENTION 



FIELD OF THE INVENTION 

20 

The invention relates generally to the fields of 
genetics and autoimmune disease and, more specifically/ to 
mutations linked to the NOD2/CARD15 gene and genetic methods 
for diagnosing clinical subtypes of Crohn's disease. 

25 

BACKGROUND INFORMATION 



Inflammatory bowel disease (IBD) is the collective 
term used to describe two gastrointestinal disorders of 
30 unknown etiology: Crohn's disease (CD) and ulcerative 

colitis (UC) . The course and prognosis of IBD, which occurs 
world-wide and is reported to afflict as many as two million 
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people, varies widely. Onset of IBD is predominantly in 
young adulthood with diarrhea, abdominal pain, and fever the 
three most common presenting symptoms. The diarrhea may 
range from mild to severe, and anemia and weight loss are 
additional common signs of IBD. Of all patients with IBD, 
ten percent to fifteen percent will require surgery over a 
ten year period. In addition, patients with IBD are at 
increased risk for the development of intestinal cancer. 
Reports of an increasing occurrence of psychological 
problems, including anxiety and depression, are perhaps not 
surprising symptoms of what is often a debilitating disease 
that strikes people in the prime of life. 

Crohn's disease is a classification representing a 
number of heterogeneous disease subtypes that affect the 
gastrointestinal tract and produce similar symptoms. Both 
environmental and genetic factors likely contribute to the 
etiology of such disease subtypes. Patients with Crohn's 
disease can be classified, for example, into subtypes based 
on the presence of f ibrostenosing disease, internal - 
perforating disease, perianal fistulizing disease or 
ulcerative colitis-like disease according to previously 
described criteria. The f ibrostenosing disease subtype is 
characterized by documented persistent intestinal 
obstruction or intestinal resection for intestinal 
obstruction. The extensive and often protracted clinical 
testing required to diagnose Crohn 1 s disease and disease 
subtypes may delay optimal treatment and involves invasive 
procedures such as endoscopy. 

Identification of genetic markers which are 
closely associated with a clinical subtype of Crohn's 
disease would provide the basis for novel genetic tests and 
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eliminate or reduce the need for the battery of laboratory, 
radiological, and endoscopic evaluations typically required 
to determine disease subtype. The availability of methods 
for diagnosing clinical subtypes of Crohn 1 s disease would 
represent a major clinical advance that would aid in the 
therapeutic management of Crohn's disease and would further 
lay the groundwork for the design of treatment modalities 
which are specific to a particular disease subtype. Such 
methods can reduce costs associated with treatment of 
unresponsive disease subtypes and eliminate the 
disappointment of those needlessly undergoing ineffective 
therapy. In particular, a reliable genetic test for the 
f ibrostenosing subtype of Crohn 1 s disease would be highly 
prized as a non-invasive method for the early diagnosis of 
this disease subtype and would also be useful for predicting 
susceptibility to the f ibrostenosing subtype of Crohn's 
disease in asymptomatic individuals, making prophylactic 
therapy possible. The present invention satisfies this need 
and provides related advantages as well . 

SUMMARY OF THE INVENTION 

The present invention provides a method of 
diagnosing or predicting susceptibility to a clinical 
subtype of Crohn's disease characterized by f ibrostenosing 
disease by determining the presence or absence in an 
individual of a f ibrostenosis-predisposing allele linked to 
a NOD2/CARD15 locus, where the presence of the 
f ibrostenosis-predisposing allele is diagnostic of or 
predictive of susceptibility to the clinical subtype of 
Crohn's disease characterized by f ibrostenosing disease. In 
a method of the invention, the clinical subtype of Crohn's 



disease can be, for example, characterized by f ibrostenosing 
disease independent of small bowel involvement. 

The invention also provides a method of optimizing 
5 therapy in an individual by determining the presence or 

absence in the individual of a f ibrostenosis-predisposing 
allele linked to a NOD2 / CARD1 5 locus, diagnosing individuals 
in which the f ibrostenosis-predisposing allele is present as 
having a f ibrostenosing subtype of Crohn's disease, and 
10 treating the individual having a f ibrostenosing subtype of 

Crohn's disease based on the diagnosis. 

BRIEF DESCRIPTION OF THE DRAWINGS 



15 Figure 1 shows the NOD2/CARD15 locus intron and 

exon structure with the location of SNP 8, SNP 12, SNP 13, 
and JW1 as well as other markers. 



Figure 2 shows the frequency of NOD2 variant 
20 carriers having at least one of the three NOD2 / CARD1 5 rare 

variant alleles (a "2" allele at SNP 8, SNP 12 or SNP 13) in 
Cohort 1 ("CD1"), Cohort 2 (*CD2") or a combination of 
Cohort 1 and Cohort 2 ("Combined CD") . The striped bars 
indicate Crohn's disease patients with f ibrostenosing 
25 disease and the solid bars indicate Crohn's disease patients 

who did not have the f ibrostenosing subtype of disease. 

Figure 3 shows the frequency of f ibrostenosing 
complications in patients relative to the number of 
30 N0D2/CARD15 rare variant alleles at SNP 8, SNP 12 or SNP 13. 

Based on genotyping at SNP 8, SNP 12 and SNP 13, patients 
were described as carrying 0, 1, or 2 rare variant alleles 
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(x axis, number of mutant NOD2 alleles) . The left y axis 
shows the frequency of f ibrostenosing complications as 
filled circles. The right y axis shows the odds ratio as a 
filled diamond with 95% confidence intervals in parentheses. 
The * indicates a p value = 0.008 and ** indicates a p value 
of 0.004 compared with 0 alleles. 

Figure 4 shows a comparison of NOD2/CARD15 variant 
allelic frequencies in patients with f ibrostenosing disease 
compared with perforating disease. Patients were separated 
by the presence of f ibrostenosing disease with perforating 
complications ("Fib + perf " ) or f ibrostenosing disease 
without perforating complications ("Fib only' 7 ) compared with 
patients with perforating complications and without evidence 
of f ibrostenosis ("Perf only") . 

Figure 5 shows the nucleotide sequence of 
NOD2/CARD15 surrounding SNP 8. The top strand is labeled as 
SEQ ID NO:l and the bottom strand is labeled as SEQ ID N0:2. 
Nucleotide sequences which can be used as primers for PCR 
amplification are indicated. In addition, the position of a 
nucleotide sequence which can be used as a probe in an 
allelic discrimination assay is indicated, in this figure, 
by a box and lower-case letters. The underlined nucleotide 
indicates the position of the polymorphic site. 

Figure 6 shows the nucleotide sequence of 
NOD2 / CARD1 5 surrounding SNP 12. The top strand is labeled 
as SEQ ID NO: 3 and the bottom strand is labeled as SEQ ID 
NO:4. Nucleotide sequences which can be used as primers for 
PCR amplification are indicated. In addition, the position 
of a nucleotide sequence which can be used as a probe in an 
allelic discrimination assay is indicated, in this figure, 



by a box and lower case letters. The underlined nucleotide 
indicates the position of the polymorphic site. 

Figure 7 shows the nucleotide sequence of 
5 NOD2 / CARD1 5 surrounding SNP 13. The top strand is labeled 

as SEQ ID NO: 5 and the bottom strand is labeled as SEQ ID 
NO: 6. Nucleotide sequences which can be used as primers for 
PCR amplification are indicated. In addition, the position 
of a nucleotide sequence which can be used as a probe in an 
10 allelic discrimination assay is indicated, in this figure, 

by a box and lower case letters. The underlined nucleotide 
indicates the position of the polymorphic site. 

Figure 8 shows the nucleotide sequence of 
15 NOD2/CARD15 surrounding SNP 5. The top strand is labeled as 

SEQ ID NO: 7 and the bottom strand is labeled as SEQ ID NO: 8. 
Nucleotide sequences which can be used as primers for PCR 
amplification are indicated. In addition, the position of a 
nucleotide sequence which can be used as a probe in an 
20 allelic discrimination assay is indicated, in this figure, 

by a box and lower case letters. The underlined nucleotide 
indicates the position of the polymorphic site. 

Figure 9 shows the nucleotide sequence of 
25 NOD2/CARD15 surrounding the JW1 variant sequence. The top 

strand is labeled as SEQ ID NO: 9 and the bottom strand is 
labeled as SEQ ID NO: 10. Nucleotide sequences which can be 
used as primers for PCR amplification are indicated. In 
addition, the position of a nucleotide sequence which can be 
30 used as a probe in an allelic discrimination assay is 

indicated, in this figure, by a box and lower case letters. 
The underlined nucleotide indicates the position of the 
polymorphic site. 



20 



30 



Figure 10 shows the nucleotide sequence of the 5 1 
untranslated region of NOD2/CARD15 in 12 individuals (SEQ ID 
NOS: 12-23) compared to the wild-type N0D2/CARD15 sequence 
(SEQ ID NO: 11) . Areas of sequence identity are shaded. The 
5 location of two polymorphic sites, JW18 and JW17 # are 

indicated. 

Figure 11 shows the nucleotide sequence of the 3 1 
untranslated region of NOD2 / CARD1 5 in 12 individuals. Areas 

10 of sequence identity are shaded. The location of two 

polymorphic sites , JW15 and JW16, are indicated. 
Figure 11 A shows the nucleotide sequence of the 3 1 
untranslated region of NOD2/CARD15 in 12 individuals (SEQ ID 
NOS:25-36) compared to the wild-type NOD2 / CARD1 5 sequence 

15 (SEQ ID NO:24) and the location of JW16 . Figure 11 B shows 

the nucleotide sequence of the 3 1 untranslated region of 
NOD2 / CARD1 5 in 12 individuals (SEQ ID NOS: 56-67) compared to 
the wild- type NOD 2 / CARD 1 5 sequence (SEQ ID NO: 55) and the 
location of JW15 . 



DETAILED DESCRIPTION OF THE INVENTION 



The present invention is directed to the exciting 
discovery of disease -predisposing alleles that are closely 
25 associated with the f ibrostenosing disease subtype of 

Crohn's disease. These fibrosteno.sis -predisposing alleles 
are linked to a NOD2/CARD15 locus as described further below 
and can be used to diagnose or predict susceptibility to the 
f ibrostenosing disease subtype of Crohn f s disease. 



As disclosed herein, genotyping and other clinical 
characterization approaches were used to identify a strong 
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association between disease-predisposing alleles and the 
f ibrostenosing disease subtype of Crohn's disease. In 
particular, two cohorts of Crohn's disease patients were 
assembled and clinically characterized (see Example I and 
Table 1) . These patients were also geno typed for three 
single nucleotide polymorphisms (SNPs) in the N0D2/CARD15 
gene, SNP 8, SNP 12, and SNP 13 , which are polymorphic 
markers associated with Crohn's disease. As disclosed 
herein, univariate analysis indicated that a "2" allele at 
SNP 8, SNP 12, or SNP 13 of the NOD2 / CARD1 5 locus was 
significantly associated with f ibrostenosing disease in 
Cohort 1 (p=0.049, see Table 5). In addition, a positive 
association at a less stringent significance level was also 
observed with small bowel involvement and younger age of 
onset, and a negative association was observed with 
ulcerative colitis-like disease in this cohort. With 
respect to serologic markers, patients with the "2" allele 
at SNP 13 were more likely to express ant 1-Saccharomyces 
cerevisiae antibodies (ASCA) (p=0.053). 

The results obtained with Cohort 1 were further 
tested using Cohort 2 as disclosed herein in Example IV. As 
with Cohort 1, Cohort 2 demonstrated a significant 
association between a "2" allele at SNP 8, SNP 12, or SNP 13 
of the NOD2/CARD15 locus and f ibrostenosing disease 
(p=0.002, see Table 6). Furthermore, the significance 
between a "2" allele at SNP 8, SNP 12, or SNP 13 of the 
NOD2 / CARD 1 5 locus and f ibrostenosing disease increased when 
the two cohorts were analyzed together (p=0.001, see Figure 
2) . 

As further disclosed herein in Example V, a "2" 
allele at SNP 8, SNP 12, or SNP 13 of the NOD2 / CARD1 5 locus 



was associated "with f ibrostenosing disease in both Jewish 
and non- Jewish individuals. Approximately 46% of Crohn's 
disease patients with f ibrostenosing disease (Jewish 
individuals 52% vs. non- Jewish individuals 42%) had at least 
5 one of these rare variant alleles compared with only 23% 

(Jewish individuals 21.6% vs. non-Jewish individuals 25%) of 
Crohn's disease patients without f ibrostenosing disease 
(Odds ratio, 2.8; 95% Confidence interval, 1.56-5.18). Of 
the three rare variant alleles, the "2" allele at SNP 13, 
10 which is a frameshift mutation denoted n 3020insC," 

demonstrated the greatest association with f ibrostenosing 
disease (47% vs. 17% f p=0.006 for cohorts combined). These 
results indicate that f ibrostenosis-predisposing alleles can 
be linked to the NOD2 / CARD1 5 locus. 

15 

As further disclosed herein in Example VI, 
patients who were carriers of two f ibrostenosis-predisposing 
alleles in NOD2 / CARD1 5 were significantly more likely to 
have f ibrostenosing disease as compared with patients who 

20 were not carriers of NOD2/CARD15 mutations at SNP 8, SNP 12 

or SNP 13 (85% vs. 43%; odds ratio 7.4; 95% confidence 
interval 1.9-28.9, p=0.004). See Figure 3. Patients who 
were carriers of a single NOD2 / CARD 1 5 f ibrostenosis- 
predisposing allele were also significantly more likely to 

2 5 have f ibrostenosing disease when compared with patients who 

were not carriers of any of the three NOD2 / CARD1 5 
f ibrostenosis-predisposing alleles assayed (64% vs. 43%; 
odds ratio 2.37; 95% confidence interval 1.26-4.47; 
p=0.008) . These results confirm that patients who have a 

30 f ibrostenosis-predisposing allele linked to a NOD2 / CARD1 5 

locus can have the f ibrostenosing subtype of Crohn's disease 
and further indicate that Crohn's disease patients with 
multiple f ibrostenosis-predisposing alleles (homozygous 



10 

mutations or compound heterozygous mutations) linked to 
N0D2 / CARD1 5 have an increased risk of f ibrostenosing disease 
as compared to individuals carrying a single 
f ibrostenosis-predisposing allele . 

5 

Fibrostenosing and perforating disease can occur 
together in the same patient. Patients with fibrostenosing 
disease can be characterized, for example, as i) having only 
fibrostenosing disease or ii) having both fibrostenosing and 

10 perforating disease. As disclosed herein in Example VII , 

the percentage of patients having only fibrostenosing 
disease that carried a "2" allele at SNP 8, SNP 12 , or SNP 
13 of the N0D2 / CARD1 5 locus was 48.3%, which was similar to 
that seen in patients with both fibrostenosing and 

15 perforating complications (46.0%; p=0.8). As seen in Figure 

4 # when patients with fibrostenosing disease were compared 
with those patients described as having perforating disease 
only (perianal or internal), the frequency of the "2" allele 
at SNP 8, SNP 12 or SNP 13 of the NOD2 /CARD15 locus in 

20 patients with fibrostenosing disease (with or without 

perforating complications) was significantly greater than 
that seen in patients with only perforating complications 
(46.6% versus 18.6%; p=0.002). 

25 As further disclosed herein in Example VIII, 

multivariant analysis was performed to investigate the 
association of a "2" allele at SNP 8, SNP 12, or SNP 13 of 
the NOD2/CARD15 locus with clinical phenotypes. For 
multivariant analysis, all variables with at least 

30 borderline significance (p< 0.1) in either cohort were 

tested simultaneously for their association with a "2" 
allele at SNP 8, SNP 12, or SNP 13 of the NOD2 / CARD1 5 locus 
using logistic regression. As shown in Table 7, the 
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clinical phenotype of f ibrostenosing disease was 
significantly associated (p< 0.05) with these rare alleles 
at the N0D2/CARD15 locus (odds ratio 2.8; 95% confidence 
interval, 1.3-6.0). These results confirm that 
5 f ibrostenosing disease is independently associated with a 

"2" allele at SNP 8, SNP 12, or SNP 13 of the N0D2/CARD15 
locus . 

Because f ibrostenosing disease is more likely to 
•10 occur in patients with small-bowel involvement, patients 

were stratified on the basis of small -bowel involvement to 
analyze whether the association between f ibrostenosing 
disease and NOD2 / CARD1 5 variant alleles was a primary 
association (see Example IX) . Among patients with 

15 small-bowel involvement, 26.4% of patients who did not have 

f ibrostenosing disease (n=53) had a "2" allele at SNP 8, SNP 
12, or SNP. 13, whereas 46.1% of. patients who had 
f ibrostenosing disease (n=102) had a "2" allele at SNP 8, 
SNP 12, or SNP 13 (p=0.017). A similar trend was observed 

20 among patients without small-bowel involvement (p=0 . 05) , and 

the combined analysis conditioning on small -bowel 
involvement yielded a significance level of 0.009. Af.ter 
controlling for f ibrostenosing disease, small -bowel 
involvement was not associated with a "2" allele at SNP 8, 

25 SNP 12, or SNP 13 of the N0D2 / CARD 1 5 locus (p= 0.63). This 

result agrees with the results from logistic regression 
analysis (see Example VIII) and indicates that the 
association between f ibrostenosing disease and a "2" allele 
at SNP 8, SNP 12, or SNP 13 of the N0D2/CARD15 locus is 

30 independent of small -bowel involvement. These results 

further indicate that the observed small -bowel association 
with a "2" allele at SNP 8, SNP 12, or SNP 13 of the 
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NOD2/CARD15 locus is secondary to the presence of 
f ibrostenosing disease. 

Based on these discoveries, the present invention 
provides a method of diagnosing or predicting susceptibility 
to a clinical subtype of Crohn's disease characterized by 
f ibrostenosing disease by determining the presence or 
absence in an individual of a f ibrostenosis-predisposing 
allele linked to a NOD2 / CARD1 5 locus, where the presence of 
the f ibrostenosis-predisposing allele is diagnostic of or 
predictive of susceptibility to the clinical subtype of 
Crohn's disease characterized by f ibrostenosing disease. 
The methods of the invention can be advantageous in that 
they are noninvasive and can be conveniently practiced, for 
example, with a blood sample from an individual. The 
methods of the invention can be used to quickly, easily and 
reliably diagnose or predict susceptibility to a clinical 
subtype of ^Crohn's disease characterized by f ibrostenosing 
disease as described further herein below. 

In one embodiment, a method of the invention is 
practiced with a f ibrostenosis-predisposing allele located 
within the N0D2 / CARD1 5 locus. In another embodiment, 
NF-kappa B activation by a NOD2 / CARD1 5 polypeptide encoded 
by the f ibrostenosis-predisposing allele is reduced as 
compared to NF-kappa B activation by a wild-type NOD2 / CARD15 
polypeptide. In a further embodiment, a method of the 
invention is practiced with a f ibrostenosis-predisposing 
allele located in a coding region of the NOD2/CARD15 locus, 
for example, within a region encoding residues 744 to 102 0 
of NOD2/CARD15. In still a further embodiment, a method of 
the invention is practiced with a f ibrostenosis-predisposing 
allele which is a "2" allele at SNP 8, SNP 12, or SNP 13. 
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In yet a further embodiment, a method of the invention is 
practiced with a fibrostenosis -predisposing allele which is 
a "2" allele at SNP 13. In another embodiment, a method of 
the invention is practiced with a f ibrostenosis-predisposing 
allele which is located in a non- coding region of the 
NOD2/CARD15 locus. Such an allele can be, without 
limitation, a JW1, JW15, or JW16 variant allele. In another 
embodiment, a method of the invention is practiced with a 
fibrostenosis-predisposing allele which is located in a 
promoter region of the NOD2/CARD15 locus. Useful 
fibrostenosis-predisposing alleles in the promoter region of 
NOD2/CARD15 include, but are not limited to, JW17 and JW18 
variant alleles. 



The present invention relates to genetic markers 
which localize to the IBD1 locus on chromosome 16. 
Utilizing genome wide scan linkage strategies, the IBD1 
locus was mapped to the proximal region of the long arm of 
chromosome 16 (16ql2) in the Caucasian population (Hugot et 
al., Nature 379:821-823 (1996)). This finding has been 
replicated in many studies, including an international 
collaborative study reporting a high multipoint linkage 
score (MLS) for a complex disease (MLS = 5.7 at marker 
D16S411 in 16ql2) . See Cho et al . , Inflamm. Bowel Pis. 
3:186-190 (1997), Akolkar et al . , Am. J. Gastroenterol , 
96:1127-1132 (2001), Ohmen et al . , Hum. Mol . G^t- , 
5:1679-1683 (1996), Parkes et al . , Lancet 348:1588 (1996), 
Cavanaugh et al . , Ann. Hum, o^nt- (1998), Brant et al . , 
Gastroenterology 115:1056-1061 (1998), Curran et al . , 
Gastroenterology 115:1066-1071 (1998), Hampe et al . , Am. J. 
Hum. Genet. 64:808-816 (1999), and Annese et al . , Eur. J. 
Hum. Genet. 7:567-573 (1999). NOD2/CARD15 within the IBD1 
locus was simultaneously identified by a positional -cloning 
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strategy (Hugot et al., Nature 411:599-603 (2001)) and a 
positional candidate gene strategy (Ogura et al . , Nature 
411:603-606 (2001), Hampe et al . , Lancet 357:1925-1928 
(2001)). The encoded NOD2 / CARD1 5 protein contains 
amino -terminal caspase recruitment domains (CARDs) , which 
can activate NF-kappa B (NF-kB) , and several 
carboxy- terminal leucine-rich repeat domains (Ogura et al, 
J. Biol. Chem. 276:4812-4818 (2001)). Figure 1 shows an 
illustration of the NOD2 / CARD1 5 locus. 

The sequence of the human NOD2/CARD15 gene can be 
found in GenBank as accession number NM_022162, which is 
incorporated by reference herein. In addition, the complete 
sequence of human chromosome 16 clone RP11-327F22, which 
includes N0D2 / CARD1 5 , can be found in GenBank as accession 
number AC007728, which is incorporated by reference herein. 
Furthermore, the sequence of N0D2/CARD15 from other species 
can be found in the GenBank database. 



polymorphisms in the coding region of NOD2 / CARD1 5 have been 
previously described. These three SNPs, designated SNP 8, 
SNP 12 and SNP 13, are located in the carboxy- terminal 
region of the NOD2/CARD15 gene (Hugot et al . , supra, 2001) . 

The invention relies, in part, on determining the 
presence or absence in an individual of a f ibrostenosing- 
predisposing allele linked to a NOD2/CARD15 locus. The term 
"f ibrostenosis-predisposing allele, 11 as used herein, means a 
stably heritable molecular variation that is linked to the 
NOD2 / CARD1 5 locus and that tends to be inherited together 
with the clinical subtype of Crohn's disease characterized 



Variations at three single nucleotide 
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by f ibrostenosing disease more often than would be expected 
according to traditional Mendelian genetics. A 
fibrostenosis -predisposing allele useful in the invention 
can be, without limitation, a single nucleotide polymorphism 
5 (SNP) , microsatellite (ms) , variable number tandem repeat 

(VNTR) polymorphism, or a substitution, insertion or 
deletion of one or more nucleotides. One skilled in the art 
understands that a fibrostenosis -predisposing allele also 
can be a molecular variation such as abnormal methylation or 
10 other modification that does not produce a difference in the 

primary nucleotide sequence of the 

fibrostenosis -predisposing allele as compared to the wild 
type allele. 

15 The term "linked to, " as used herein in reference 

to a f ibrostenosis-predisposing allele and a N0D2 / CARD1 5 
locus, means that the f ibrostenosis-predisposing allele and 
the NOD2/CARD15 locus are inherited together more often than 
would be expected according to traditional Mendelian 

20 genetics- It is understood that an allele and locus are 

linked when there is less than 50% recombination between the 
two sites. A f ibrostenosis-predisposing allele is generally 
separated from a NOD2/CARD15 Iqcus by at most 50 centiMorgan 
(cM) . As non-limiting examples, a f ibrostenosis- 

25 predisposing allele can be within 50 centiMorgan (cM) , 40 

CM, 30 cM, 20 cM, 10 CM, 5 cM, or 1 cM of the NOD2 / CARD1 5 
locus. The distance between a linked 

f ibrostenosis-predisposing allele and a NOD2/CARD15 locus 
can also be, for example, 50,000,000 base pairs (bps), 
30 40,000,000 bps, 30,000,000 bps, 20,000,000 bps, 10,000,000 

bps, 5,000,000 bps, 1,000,000 bps, 500,000 bps, 100,000 bps, 
50,000 bps, 10,000 bps, 1,000 bps or 100 bps. 
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The methods of the invention are useful for 
diagnosing or predicting susceptibility to a clinical 
subtype of Crohn's disease (regional enteritis), which is a 
disease of chronic inflammation that can involve any part of 
5 the. gastrointestinal tract . Commonly the distal portion of 

the small intestine (ileum) and cecum are affected in 
Crohn's disease. In other cases, the disease is confined to 
the small intestine, colon or anorectal region. Crohn's 
disease occasionally involves the duodenum, and stomach, and 
10 more rarely the esophagus and oral cavity. 

The- variable clinical manifestations of Crohn's 
disease are, in part, a result of the varying anatomic 
localization of the disease. The most frequent symptoms of 

15 Crohn's disease are abdominal pain, diarrhea and recurrent 

fever. Crohn's disease is commonly associated with 
intestinal obstruction or fistula, which is an abnormal 
passage, for example, between diseased loops of bowel. 
Crohn's disease also can include extra- intestinal 

20 complications such as inflammation of the eye, joints and 

skin; liver disease; kidney stones or amyloidosis; and is 
associated with an increased risk of intestinal cancer. 

Several features are characteristic of the 
25 pathology of Crohn's disease. The inflammation associated 

with Crohn's disease, known as transmural inflammation, 
involves all layers of the bowel wall. Thickening and 
edema, for example, typically appear throughout the bowel 
wall, with fibrosis also present in long-standing disease. 
30 The inflammation characteristic of Crohn's disease also is 

discontinuous, with segments of inflamed tissue, known as 
"skip lesions, 11 separated by apparently normal intestine. 
Furthermore, linear ulcerations, edema, and inflammation of 
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the intervening tissue lead to a "cobblestone" appearance of 
the intestinal mucosa, which is distinctive of Crohn's 
disease. 

A hallmark of Crohn's disease is the presence of 
discrete aggregations of inflammatory cells, known as 
granulomas, which are generally found in the submucosa. 
About half of Crohn's disease cases display the typical 
discrete granulomas, while others show a diffuse 
granulomatous reaction or nonspecific transmural 
inflammation. As a result, the presence of discrete 
granulomas is indicative of Crohn's disease, although the 
absence granulomas also, is consistent with the disease. 
Thus, transmural or discontinuous inflammation, rather than 
the presence of granulomas, is a preferred diagnostic 
indicator of Crohn's disease (Rubin and Farber, Pathology 
(Second Edition) Philadelphia: J.B. Lippincott Company 
(1994) ) . 

Crohn's disease is a classification representing a 
number of heterogeneous disease subtypes that affect the 
gastrointestinal tract and may produce similar symptoms. As 
non-limiting examples, patients with Crohn's disease can be 
characterized as having subtypes characterized by 
f ibrostenosing disease, internal -perforating disease, 
perianal f istulizing disease or ulcerative colitis (UC) -like 
disease based on previously described criteria (Gasche et 
al. f Inflammatory Bowel Diseases 6:8-15 (2000); Vasiliauskas 
et al., Gut 47:487-496 (2000); Vasiliauskas et al.. 
Gastroenterology 110:1810-1819 (1996); and Greenstein et 
al., Gut 29:588-592 (1988)) . 
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According to well-established criteria 
f ibrostenosing disease is defined by documented persistent 
intestinal obstruction or an intestinal resection for an 
intestinal obstruction. Radiographic, endoscopic, surgical 
5 or histopathological documentation can be used to confirm a 

diagnosis of the f ibrostenosing subtype of Crohn's disease. 
The f ibrostenosing subtype of Crohn's disease can be 
accompanied by other symptoms such as perforations, 
abscesses or fistulae. In addition, the f ibrostenosing 

10 subtype of Crohn's disease can be characterized by 

persistent symptoms of intestinal blockage such as nausea, 
vomiting, abdominal distention and inability to eat solid 
food. Intestinal X-rays of patients with the f ibrostenosing 
subtype of Crohn's disease can show, for example, distention 

15 of the bowel before the point of blockage. 

Additional subtypes of Crohn's disease can be 
identified using defined clinical criteria. For example, 
internal perforating disease can be defined as current or 

2 0 previous evidence of entero-enteric or entero- vesicular 

fistulae, intraabdominal abscesses, or small bowel 
perforation. Perianal perforating disease can be defined by 
current or previous evidence of either perianal fistulae or 
abscesses or rectovaginal fistula. UC-like disease can be 

25 defined by current or previous evidence of left -sided 

colonic involvement, symptoms of bleeding or urgency, and 
crypt abscesses on colonic biopsies as previously described. 
Disease location can be classified as small bowel, colon, or 
both based on one or more endoscopic, radiologic or 

30 pathologic studies. 
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The f ibrostenosing subtype of Crohn's disease can 
occur in patients having disease with small -bowel 
involvement. As disclosed herein, after controlling for 
f ibrostenosing disease, small -bowel involvement was not 
5 associated with a "2" allele at SNP 8, SNP 12, or SNP 13 of 

the N0D2/CARD15 locus (p=0.63; see Example IX), This result 
agrees with the results from logistic regression analysis 
disclosed in Example VIII and indicates that the association 
between f ibrostenosing disease and a "2 11 allele at SNP 8, 

10 SNP 12, or SNP 13 of the NOD2/CARD15 locus is independent of 

small-bowel involvement. Based on these findings, the 
present invention provides a method of diagnosing or 
predicting susceptibility to a clinical subtype of Crohn's 
disease characterized by f ibrostenosing disease independent 

15 of small bowel involvement by determining the presence or 

absence in an individual of a f ibrostenosis-predisposing 
allele linked to a NOD2 / CARD 15 locus, where the presence of 
the f ibrostenosis-predisposing allele is diagnostic of or 
predictive of susceptibility to the clinical subtype of 

20 Crohn 1 s disease characterized by f ibrostenosing disease 

independent of small bowel involvement . 

The diagnostic methods of the invention are 
practiced in an individual. As used herein, the term 

25 "individual" means an animal, such as a human or other 

mammal, capable of having the f ibrostenosing subtype of 
Crohn 1 s disease. An individual can have one or more 
symptoms of Crohn's disease or the f ibrostenosing subtype of 
Crohn's disease or can be asymptomatic. The methods of the 

30 invention can be useful, for example, for diagnosing the 

f ibrostenosing subtype of Crohn's disease in an individual 
with one or more symptoms, or for predicting susceptibility 
to the f ibrostenosing subtype of Crohn's disease in an 



dL03S«3:C ,013003 



20 

* 

asymptomatic individual such as an individual at increased 
risk for having the f ibrostenosing subtype of Crohn's 
disease. In one embodiment, the methods of the invention 
are used to determine the presence or absence of the 
f ibrostenosing subtype of Crohn's disease in an individual 
known to have Crohn's disease. 

The methods of the invention can be useful, for 
example, to diagnose or predict susceptibility to the 
f ibrostenosing subtype of Crohn' s disease in an Ashkenazi 
Jewish individual. Crohn's disease is significantly more 
common (2 to 8 fold higher) in Ashkenazi Jews than in 
non- Jewish Caucasians (Brant et al . , * Gastroenterol . 
115:1056-1061 (1998)). Furthermore, among persons of Jewish 
ethnicity, American or European Ashkenazi Jews have a 2 to 4 
fold increased risk of having this inflammatory bowel 
disease compared with Sephardic or Oriental Jews (Yang and 
Rotter in Kirschner and. Shorter (Eds.), Inflammatory Bowel 
Disease Baltimore: Williams and. Wilkins, p. 301-331 (1995); 
Rozen et al . , Gastroenterol . 76:25-30 (1979)).. The empiric 
risk of Crohn's disease for a first degree relative of a 
proband with Crohn's disease is 7.8% for Jews compared with 
5.2% for non-Jews (p=0.005; Yang et al . , Gut 34:517-524 
(1993)). Thus, the Jewish population and especially the 
Ashkenazi Jewish population represents a group at increased 
risk for Crohn's and autoimmune diseases of related 
etiology. 

The methods of the invention rely on 
f ibrostenosis-predisposing alleles linked to the NOD2/CARD15 
locus. NOD2 / CARD1 5 has structural homology with both 
apoptosis regulators Apaf-l/CED-4 and a class of plant 
disease resistant gene products (Ogura et al . , J. Biol. 
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Chem , 276:4812-4818 (2001)). Similar to plant disease 
resit ant gene products, N0D2/CARD15 has an amino- terminal 
effector domain, a nucleotide -binding domain and leucine 
rich repeats (LRRs) . Wild-type NOD2/CARD15 activates 
nuclear factor NF-kappa B, making it responsive to bacterial 
lipopolysaccharides (LPS; Ogura et al . , J. Biol. Chem , 
276:4812-4818 (2001); Inohara et al . , J. Biol. Chem. 
276:2551-2554 (2001). NOD2 / CARD1 5 can function as an 
intercellular receptor for LPS , with the leucine rich 
repeats required for responsiveness. Like NOD2/CARD15, 
cytosolic plant disease resistant polypeptides have 
carboxy- terminal leucine rich repeats that are important for 
the recognition of pathogen components and induction of 
pathogen-specific responses (Parniske et al . , Cell 
91:821-832 (1997); Ellis et al . . Plant Cell 11:495-506 
(1999) ; Dixon et al . , Proc . Natl. Acad. Sci . USA 97: 8807- 
8814 (2000)). 

In one embodiment, the fibrostenosis -predisposing 
allele is located within the NOD2/CARD15 locus, a schematic 
of which is shown in Figure 1. The NOD2 / CARD1 5 locus 
includes coding regions of the NOD2 / CARD1 5 gene as well as 
non- coding regions such as introns and 5 1 and 3 1 
untranslated regions. One skilled in the art understands 
that the NOD2 / CARD 15 locus can include, for example, 
promoter regions 5 1 of the gene, enhancer regions 5' or 3 1 
of the gene or in intronic sequences, and mRNA stability 
regions 3 ■ of the gene . 

In another embodiment, the 
f ibrostenosis-predisposing allele is located in a coding 
region of the NOD2 / CARD1 5 locus, for example within a region 
encoding residues 744 to 1020 of NOD2/CARD15. Residues 744 
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to 102 0 of the NOD2/CARD15 polypeptide contain several 
leucine-rich repeats in the carboxy- terminal portion of the 
NOD2/CARD15 polypeptide. Fibrostenosis-predisposing alleles 
located in a region encoding residues 744 to 1020 of 
NOD2/CARD15 include, without limitation, SNP 12 and SNP 13. 
A fibrostenosis-predisposing allele useful in the invention 
also can be an allele which encodes a NOD2 / CARD1 5 
polypeptide with reduced NF-kappa B activation as compared 
to NF-kappa B activation by a wild- type NOD2/CARD15 
polypeptide. As an example, a rare variant allele at SNP 13 
results in a truncated N0D2/CARD15 polypeptide which has 
reduced ability to induce NF-kappa B in response to LPS 
stimulation (Ogura et al., Nature 411:603-606 (2001)). 

A fibrostenosis-predisposing allele useful in the 
invention can be, for example, a "2" allele at SNP 8, SNP 12 
or SNP 13. SNP 8, SNP 12, and SNP 13 are located within the 
coding region of NOD2/CARD15 as shown in Figure 1. In one 
embodiment, a method of the invention is practiced with a 
fibrostenosis-predisposing allele which is the SNP 8 w 2"* 
allele. As used herein, the term "SNP 8" means a single 
nucleotide polymorphism within exon 4 in the NOD2 / CARD1 5 
gene, which occurs within a triplet encoding amino acid 702 
of the NOD2 / CARD1 5 protein. The "1" allele, in which 
cytosine (c) resides at position 138,991 of the AC007728 
sequence, is the common or "wildrtype" SNP 8 allele and 
occurs within a triplet encoding an arginine at amino 
acid 702. The "2" allele of SNP 8, in which thymine (t) 
resides at position 138,991 of the AC007728 sequence, is a 
rare variant that results in an arginine (R) to tryptophan 
(W) substitution at amino acid 702 of the N0D2 / CARD 1 5 
protein. Accordingly, the rare "2" allele at SNP 8 is 
denoted "R702W" or "702W" and can also be denoted "R675W" 
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based on the earlier numbering system of Hugot et al . , 
supra, 2001. The NCBI SNP ID number for SNP 8 is rs2066844 / 
which is incorporated herein by reference. As disclosed 
herein and described further below, the presence of allele 
5 "1" or "2" at SNP 8, or another SNP described below, can be 

conveniently detected, for example, by allelic 
discrimination assays or sequence analysis. 

A method of the invention also can be practiced 

10 with a fibrostenos is -predisposing allele which is the SNP 12 

w 2" allele. As used herein, the term "SNP 12" means a 
single nucleotide polymorphism within exon 8 in the 
NOD2/CARD15 gene, which occurs within a triplet encoding 
amino acid 908 of the NOD2/CARD15 protein. The "1" allele, 

15 in which guanine (g) resides at position 128,3 77 of 1 the 

AC007728 sequence, is the common or "wild- type 11 SNP 12 
allele and occurs within a triplet encoding glycine at amino 
acid 908. The "2" allele of SNP 12, in which cytosine (c) 
resides at position 128,377 of the AC007728 sequence, is a 

20 rare variant that results in a glycine (G) to arginine (R) 

substitution at amino acid 908 of the NOD2 / CARD 1 5 protein. 
This rare "2" allele at SNP 12 is denoted n G908R" or ^OSR" 
and can also be denoted "G881R" based on the earlier 
numbering system of Hugot et al . , supra, 2001. SNP 12 is 

25 located within the leucine rich repeat region of the 

NOD2 / CARD 1 5 gene. The NCBI SNP ID number for SNP 12 is 
rs2066845, which is incorporated herein by reference. 

A method of the invention also can be practiced 
30 with a f ibrostenosis -predisposing allele which is the SNP 13 

"2" allele. This allele is an insertion of a single 
nucleotide that results in a frame shift in the tenth 
leucine-rich repeat of the NOD2 / CARD1 5 protein and is 
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followed by a premature stop codon. The resulting 
truncation of the NOD2/CARD15 protein appears to prevent 
activation of NF-kB in response to bacterial 
lipopolysaccharides (Ogura et al . , supra., 2001). As used 
5' herein, the term "SNP 13" means a single nucleotide 

polymorphism within exon 11 in the NOD2 / CARD1 5 gene f which 
occurs in a triplet encoding amino acid 1007 of the 
N0D2/CARD15 protein. The "2" allele of SNP 13, in which a 
cytosine has been added at position 121,139 of the AC007728 

10 sequence, is a rare variant resulting in a frame shift 

mutation at amino acid 1007. Accordingly, the rare "2" 
allele at SNP 13 is denoted l, 1007fs" and can also be denoted 
w 302 0insC," or "980fs" based on the earlier numbering system 
of Hugot et al., supra, 2001. The NCBI SNP ID number for 

15 SNP 13 is rs2066847, which is incorporated herein by 

reference . 

One skilled in the art recognizes that a 
particular polymorphic allele can be conveniently defined, 

20 for example, in comparison to a Centre d 1 Etude du 

Polymorphisme Humain (CEPH) reference individual such as the 
individual designated 1347-02 (Dib et al . , Nature 
380:152-154 (1996)), using commercially available reference 
DNA obtained, for example, from PE Biosystems (Foster City, 

25 CA) . In addition, specific information on SNPs can be 

obtained from the dbSNP of the National Center for 
Biotechnology Information (NCBI) . 

A f ibrostenosis -predisposing allele also can be 
30 located in a non-coding region of the N0D2/CARD15 locus. 

Non-coding regions include, for example, intron sequences as 
well as 5' and 3 1 untranslated sequences. Examples of a 
fibrostenosis -predisposing allele in an intron sequence of 
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the NOD2/CARD15 gene include, but are not limited to, the 
JW1 variant allele, which is described below. Examples of 
f ibrostenosis-predisposing alleles located in the 3' 
untranslated region of the NOD2/CARD15 gene include, without 
limitation, JW15 and JW16 variant alleles, which are 
described below. Examples of f ibrostenosis-predisposing 
alleles located in the 5 1 untranslated region of the 
NOD2 / CARD1 5 gene include, without limitation, the JW17 and 
JW18 variant alleles, which are described below. In one 
embodiment, a method of the invention is practiced with a 
f ibrostenosis-predisposing allele located in a non-coding 
region of the NOD2/CARD15 locus which is a JWl, JW15 or JW16 
variant allele. In another embodiment, a method of the 
invention is practiced with a f ibrostenosis-predisposing 
allele located in a promoter region of the NOD2 / CARD1 5 locus 
which is a JW17 or JW18 variant allele. 

As used herein, the term "JWl variant allele" 
means a genetic variation at nucleotide 158 of intervening 
sequence 8 (intron 8) of a N0D2 / CARD1 5 gene. In relation to 
the AC007728 sequence, the JWl variant is located at 
position 128,143. The genetic variation at nucleotide 158 
of intron 8 can be, but is not limited to, a single 
nucleotide substitution, multiple nucleotide substitutions, 
or a deletion or insertion of one or more nucleotides . The 
wild type sequence of intron 8 has a cytosine at 
position 158; as non-limiting examples, a JWl variant allele 
can have a cytosine (C) to adenine (A) , cytosine to guanine 
(G) , or cytosine to thymine (T) substitution at nucleotide 
158 of intron 8. In one embodiment, the JWl variant allele 
is a change from a cytosine to a thymine at nucleotide 158 
of NOD2/CARD15 intron 8. 
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As used herein, the term "JW15 variant allele" 
means a genetic variation in the 3 1 untranslated region of 
NOD2/CARD15 at nucleotide position 118 , 790 of the AC00772 8 
sequence. The genetic variation at nucleotide 118,790 can 
be, but is not limited to, a single nucleotide substitution, 
multiple nucleotide substitutions, or a deletion or 
insertion of one or more nucleotides. The wild type 
sequence has an adenine (A) at position 118,790; as 
non-limiting examples, a JW15 variant allele can have an 
adenine (A) to cytosine (C) , adenine to guanine (G) , or 
adenine to thymine (T) substitution at nucleotide 118,790. 
In one embodiment, the JW15 variant allele is a change from 
an adenine to a cytosine at nucleotide 118,790, 

As used herein, the term "JW16 variant allele" 
means a genetic variation in the 3 • untranslated region of 
NOD2/CARD15 at nucleotide position 118,031 of the AC007728 
sequence- The genetic variation at nucleotide 118,031 can 
be, but is not limit.ed to, a single nucleotide substitution, 
multiple nucleotide substitutions, or a deletion or 
insertion of one or more nucleotides. The wild type 
sequence has a guanine <G) at position 118,031; as 
non-limiting examples, a JW16 variant allele can have a 
guanine (G) to cytosine (C) , guanine to adenine (A) , or 
guanine to thymine (T) substitution at nucleotide 118,031. 
In one embodiment, the JW16 variant allele is a change from 
a guanine to an adenine at nucleotide 118,031. 

As used herein, the term "JW17 variant allele" 
means a genetic variation in the 5 1 untranslated region of 
NOD2/CARD15 at nucleotide position 154,688 of the AC007728 
sequence. The genetic variation at nucleotide 154,688 can 
be, but is not limited to, a single nucleotide substitution, 
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multiple nucleotide substitutions, or a deletion or 
insertion of one or more nucleotides. The wild type 
sequence has a cytosine (C) at position 154,688; as 
non-limiting examples, a JW17 variant allele can have a 
cytosine (C) to guanine (G) , cytosine to adenine (A) , or 
cytosine to thymine (T) substitution at nucleotide 154,688. 
In one embodiment, the *JW17 variant allele is a change from 
a cytosine to a thymine at nucleotide 154,688. 

As used herein, the term "JW18 variant allele" 
means a genetic variation in the 5 1 untranslated region of - 
NOD2/CARD15 at nucleotide position 154,471 of the AC007728 
sequence. The genetic variation at nucleotide 154,471 can 
be, but is not limited to, a single nucleotide substitution, 
multiple nucleotide substitutions, or a deletion or 
insertion of one or more nucleotides. The wild type 
sequence has a cytosine (C) at position 154,471; as 
non- limiting examples, a JW18 variant allele can have a 
cytosine (C) to guanine (G) , cytosine to adenine (A) , or 
cytosine to thymine (T) substitution at nucleotide 154,471. 
In one embodiment, the JW18 variant allele is a change from 
a cytosine to a thymine at nucleotide 154,471. 

Further provided herein is a method of diagnosing 
or predicting susceptibility to a clinical subtype of 
Crohn's disease characterized by f ibrostenosing disease by 
determining the presence or absence in an individual of at 
least two fibrostenosis -predisposing alleles linked to a 
NOD2/CARD15 locus, where the presence of one or more of the 
f ibrostenosis -predisposing alleles is diagnostic of or 
predictive of susceptibility to the clinical subtype of 
Crohn's disease characterized by f ibrostenosing disease. In 
a method of the invention, the at least two 
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f ibrostenosis-predisposing alleles are w 2 n alleles at SNP 8, 
SNP 12 or SNP 13. 



Further provided herein is a method of diagnosing 
5 or predicting susceptibility to a clinical subtype of 

Crohn's disease characterized by f ibrostenosing disease by 
determining the presence or absence in an individual of at 
least two f ibrostenosis-predisposing alleles linked to a 
NOD2/CARD15 locus , where the presence of one or more of the 

10 f ibrostenosis-predisposing alleles is diagnostic of or 

predictive of susceptibility to the clinical subtype of 
Crohn's disease characterized by f ibrostenosing disease. In 
one embodiment, the at least two f ibrostenosis-predisposing 
alleles are "2" alleles at SNP 8, SNP 12, or SNP 13. In 

15 another embodiment, a method of the invention for diagnosing 

or predicting susceptibility to a clinical subtype of 
Crohn's disease characterized by f ibrostenosing disease is 
practiced by determining the presence or absence in the 
individual of (i) a "2" allele at SNP 8, (ii) a "2" allele 

20 at SNP 12, and (iii) a "2" allele at SNP 13, where the 

presence of one or more of the "2" alleles at SNP 8, SNP 12, 
and SNP 13 is diagnostic of or predictive of susceptibility 
to the clinical subtype of Crohn's disease characterized by 
f ibrostenosing disease . 

25 

The strength of the association between a 
f ibrostenosing-predisposing allele and Crohn's disease can 
be characterized by a particular odds ratio such as an odds 
ratio of at least 2 with a lower 95% confidence interval 
30 limit of greater than 1. Such an odds ratio can be, for 

example, at least 3.0, 4.0, 5.0, 6.0, 7.0, or 8.0 or greater 
with a lower 95% confidence interval limit of greater 
than 1, such as an odds ratio of at least 7.4 with a 95% 
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confidence interval of 1.9-28.9 (see Figure 3). In 
addition, an odds ratio can be, for example, at least 2.37 
with a lower confidence interval limit of 1.26-4.47 (see 
Figure 3) . In one embodiment, the f ibrostenosis- 
predisposing allele is associated with the clinical subtype 
of Crohn's disease characterized by f ibrostenosing disease 
with an odds ratio of at least 2 and a lower 95% confidence 
limit greater than 1. Methods for determining an odds ratio 
are well known in the art (see, for example, Schlesselman et 
al . , Case Control Studies: Design. Conduct and Analysis 
Oxford University Press, New York (1982)) . 

In one embodiment, a fibrostenosis -predisposing 
allele is associated with the f ibrostenosing subtype of 
Crohn's disease with a p value of equal to or less than 
0.05. In other embodiments, a fibrostenosis -predisposing 
allele is associated with the f ibrostenosing subtype of 
Crohn's disease with a p value of equal to or less than 0.1. 
As used herein, the term "p value" is synonymous with 
"probability value." As is well known in the art, the 
expected p value for the association between a random allele 
and disease is 1.00. A p value of less than about 0.05 
indicates that the allele and disease do not appear together 
by chance but are influenced by positive factors. 
Generally, the statistical threshold for significance of 
linkage has been set at a level of allele sharing for which 
false positives would occur once in twenty genome scans 
(p=0.05). In particular embodiments, a 

f ibrostenosis-predisposing allele is associated with a 
clinical subtype of Crohn's disease characterized by 
fibrostenosis with a p value of equal to or less than 0.1, 
0.05, 0.04, 0.03, 0.02, 0.01, 0.009, 0.008, 0.007, 0.006, 
0.005, 0.004, 0.003, 0.002 or 0.001, or with a p value of 
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less than 0.00095, 0.0009, 0.00085, 0.0008 or 0.0005. It is 
recognized that, in some cases, p values may need to be 
corrected, for example, to account for factors such as 
sample size (number of families) , genetic heterogeneity, 
5 clinical heterogeneity, or analytical approach (parametric 

or nonparametric method) . 

A variety of means can be used to determine the 
presence or absence of a f ibrostenosing-predisposing allele 
10 in a method of the invention. As an example, enzymatic 

amplification of nucleic acid from an individual can be 
conveniently used to obtain nucleic acid for subsequent 
analysis . The presence or absence of a 

fibrostenosis -predisposing allele also can be determined 
15 directly from the individual 1 s nucleic acid without 

enzymatic amplification. 

Analysis of nucleic acid from an individual, 
whether amplified or not, can be performed using any of 

20 various techniques . Useful techniques include, without 

limitation, polymerase chain reaction based analysis, 
sequence analysis and electrophoretic analysis, which can be 
used alone or in combination. As used herein, the term 
"nucleic acid" means a polynucleotide such as a single- or 

25 double- stranded DNA or RNA molecule including, for example, 

genomic DNA, cDNA and mRNA. The term nucleic acid 
encompasses nucleic acid molecules of both natural and 
synthetic origin as well as molecules of linear, circular or 
branched configuration representing either the sense or 

30 antisense strand, or both, of a native nucleic acid 

molecule. It is understood that such nucleic acid can be 
attached to a synthetic material such as a bead or column 
matrix. 
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The presence or absence of a f ibrostenosing- 
predisposing allele can involve amplification of an 
individual's nucleic acid by the polymerase chain reaction. 
Use of the polymerase chain reaction for the amplif ication 
of nucleic acids is well known in the art (see, for example, 
Mullis et al. (Eds.), The Polymerase Chain Reaction . 
Birkhauser, Boston, (1994))* In one embodiment, polymerase 
chain reaction amplification is performed using one or more 
f luorescently labeled primers. In another embodiment, 
polymerase chain reaction amplification is performed using 
one or more labeled or unlabeled primers that contain a DNA 
minor grove binder - 



an individual's nucleic acid by the polymerase chain 
reaction. For example, the PCR primers listed in Table 2 
(SEQ ID NOS: 37-44) and shown in Figures 5 to 8 can be used 
to amplify specific regions in the N0D2/CARD15 locus of an 
individual's nucleic acid. For example, the region 
surrounding SNP- 8 can be amplified using SEQ ID NO: 39 and 
40; SNP 12 can be amplified using SEQ ID NOS: 41 and 42, and 
the region surrounding SNP 13 can be amplified using SEQ ID 
NOS: 43 and 44. In addition, for example, the region 
surrounding. As understood by one skilled in the art, 
additional primers for PCR analysis can be designed based on 
the sequence flanking the region of interest . As a non- 
limiting example, a sequence primer can contain about 15 to 
3 0 nucleotides of a sequence upstream or downstream of the 
region of interest. Such primers are generally designed to 
have sufficient guanine and cytosine content to attain a 
high melting temperature which allows for a stable annealing 
step in the amplification reaction. Several computer 



Several different primers can be used to amplify 
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programs, such as Primer Select, are available to aid in the 
design of PCR primers. 

A Taqman® allelic discrimination assay available 
5 from Applied Biosystems can be useful for determining the 

presence or absence of a f ibrostenosing-predisposing . allele . 
In a Taqman® allelic discrimination assay, a specific, 
fluorescent, dye-labeled probe for each allele is 
constructed. The probes contain different fluorescent 

10 reporter dyes such as FAM and VICTM to differentiate the 

amplification of each allele. In addition, each probe has a 
quencher dye at one end which quenches fluorescence by 
fluorescence resonance energy transfer (FRET) . During PCR, 
each probe anneals specifically to complementary sequences 

15 in the nucleic acid from the individual . The 5 1 nuclease 

activity of Taq polymerase is used to cleave only probe that 
hybridize to the allele. Cleavage separates the reporter 
dye from the quencher dye, resulting in increased 
fluorescence by the reporter dye. Thus, the fluorescence 

20 signal generated by PCR amplification indicates which 

alleles are present in the sample. Mismatches between a 
probe and allele reduce the efficiency of both probe 
hybridization and cleavage by Taq polymerase, resulting in 
little to no fluorescent signal. Improved specificity in 

25 allelic discrimination assays can be achieved by conjugating 

a DNA minor grove binder (MGB) group to a DNA probe as 
described, for example, in Kutyavin et al . , Nuc. Acids 
Research 28:655-661 (2000). Minor grove binders include, 
but are not limited to, compounds such as 

30 dihydrocyclopyrroloindole tripeptide (DPI3) . 



Sequence analysis also can be useful for 
determining the presence or absence of a f ibrostenosing- 
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predisposing allele in a method of the invention, A 
f ibrostenosing-predisposing allele can be detected by 
sequence analysis using primers disclosed herein, for 
example, as the PCR primers listed in Table 2 (SEQ ID 
5 NOS: 37-44) and shown in Figures 5 to 8 . As understood by 

one skilled in the art, additional primers for sequence 
analysis can be designed based on the sequence flanking the 
region of interest. As a non-limiting example, a sequence 
primer can contain about 15 to 3 0 nucleotides of a sequence 
10 about 40 to 400 base pairs upstream or downstream of the 

region of interest. Such primers are generally designed to 
have sufficient guanine and cytosine content to attain a 
high melting temperature which allows for a stable annealing 
step in the sequencing reaction. 

15 

The term "sequence analysis," as used herein in 
reference to one or more nucleic acids, means any manual or 
automated process by which the order of nucleotides in the 
nucleic acid is determined. As an example, sequence 

20 analysis can be used to determine the nucleotide sequence of 

a sample of DNA. The term sequence analysis encompasses, 
without limitation, chemical and enzymatic methods such as 
dideoxy enzymatic methods including, for example, 
Maxam-Gilbert and Sanger sequencing as well as variations 

2 5 thereof. The term sequence analysis further encompasses, 

but is not limited to, capillary array DNA sequencing, which 
relies on capillary electrophoresis and laser-induced 
fluorescence detection and can be performed using 
instruments such as the MegaBACE 1000 or ABI 3700. As 

30 additional non-limiting examples, the term sequence analysis 

encompasses thermal cycle sequencing (Sears et al . , 
Biotech niaues 13:626-633 (1992)); solid-phase sequencing 
(Zimmerman et al . , Methods Mol. Cell Biol. 3:39-42 (1992); 
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and sequencing with mass spectrometry such as 
matrix-assisted laser desorption/ionization time-of -flight 
mass spectrometry MALDI-TOF MS (Fu et al . , Nature Biotech . 
16: 381-384 (1998)). The term sequence analysis also 
includes, yet is not limited to, sequencing by hybridization 
(SBH) , which relies on an array of all possible short 
oligonucleotides to identify a segment of sequences present 
in an unknown DNA (Chee et al . , Science 274:610-614 (1996); 
Drmanac et al . , Science 260:1649-1652 (1993); and Drmanac et 
al., Nature Biotech. 16:54-58 (1998)). One skilled in the 
art understands that these and additional variations are 
encompassed by the term sequence analysis as defined herein. 
See, in general, Ausubel et al . , supra., Chapter 7 and 
supplement 47. 

The invention also provides a method of diagnosing 
or predicting susceptibility to a clinical subtype of 
Crohn's disease characterized by f ibrostenosing disease by 
determining the presence or absence in an individual of a 
f ibrostenosis-predisposing allele linked to a N0D2/CARD15 
locus, where the presence of the fibrostenosis -predisposing 
allele is diagnostic of or predictive of susceptibility to 
the clinical subtype of Crohn's disease characterized by 
f ibrostenosing disease, and where the method includes the 
steps of obtaining material containing nucleic acid 
including the NOD2 / CARD15 locus from the individual. As 
used herein, the term "material" means any biological matter 
from which nucleic acid can be prepared. As non-limiting 
examples, the term material encompasses whole blood, plasma, 
saliva, cheek swab, or other bodily fluid or tissue that 
contains nucleic acid. In one embodiment, a method of the 
invention is practiced with whole blood, which can be 
obtained readily by non-invasive means and used to prepare 
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genomic DNA, for example, for enzymatic amplification or 
automated sequencing. In another embodiment, a method of 
the invention is practiced with tissue obtained from an- 
individual such as tissue obtained during surgery or biopsy 
5 procedures . 

Elect rophoretic analysis also can be useful in the 
methods of the invention. Elecrophoretic analysis, as used 
herein in reference to one or more nucleic acids such as 

10 amplified fragments, means a process whereby charged 

molecules are moved through a stationary medium under the 
influence of an electric field. Electrophoretic migration 
separates nucleic acids primarily on the basis of their 
charge, which is in proportion to their size, with smaller 

15 molecules migrating more quickly. The term electrophoretic 

analysis includes, without- limitation, analysis using slab 
gel electrophoresis, such as agarose or polyacrylamide gel 
electrophoresis, or capillary electrophoresis. Capillary 
electrophoretic analysis generally occurs inside a 

20 small -diameter (50-100- m) quartz capillary in the presence 

of high (kilovolt- level) separating voltages with separation 
times of a few minutes. Using capillary electrophoretic 
analysis, nucleic acids are conveniently detected by UV 
absorption or fluorescent labeling, and .single-base 

25 resolution can be obtained on fragments up to several 

hundred base pairs. Such methods of electrophoretic 
analysis, and variations thereof, are well known in the art, 
as described, for example, in Ausubel et al . , Current 
Protocols in Molecular Biology Chapter 2 (Supplement 45) 

30 John Wiley & Sons, Inc. New York (1999) . 



Restriction fragment length polymorphism (RFLP) 
analysis also can be useful for determining the presence or 
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absence of a f ibrostenosis -predisposing allele in a method 
of the invention (Jarcho et al . in Dracopoli et al . , Current 
Protocols in Human Genetics pages 2.7.1-2.7.5, John Wiley & 
Sons, New York; Innis et al.,(Ed.), PCR Protocols . San 
5 Diego: Academic Press, Inc. (1990)). As used herein, 

restriction fragment length polymorphism analysis is any 
method for distinguishing genetic polymorphisms using a 
restriction enzyme, which is an endonuclease that catalyzes 
the degradation of nucleic acid and" recognizes a specific 
10 base sequence, generally a palindrome or inverted repeat. 

One skilled in the art understands that the use of RFLP 
analysis depends upon an enzyme that can differentiate two 
alleles at a polymorphic site. 

15 Allele-specif ic oligonucleotide hybridization also 

can be used to detect the presence or absence of a 
f ibrostenosis -predisposing allele . Allele-specif ic 
oligonucleotide hybridization is based on the use of a 
labeled oligonucleotide probe having a sequence perfectly 

20 complementary, for example, to the sequence encompassing a 

f ibrostenosis -predisposing allele. Under appropriate 
conditions, the allele-specif ic probe hybridizes to a 
nucleic acid containing the f ibrostenosis -predisposing 
allele but does not hybridize to the one or more other 

25 alleles, which have one or more nucleotide mismatches as 

compared to the probe. If desired, a second allele-specif ic 
oligonucleotide probe that matches an alternate allele also 
can be used. Similarly, the technique of allele-specif ic 
oligonucleotide amplification can be used to selectively 

3 0 amplify, for example, a f ibrostenosis-predisposing allele by 

using an allele-specif ic oligonucleotide primer that is 
perfectly complementary to the nucleotide sequence of the 
f ibrostenosis-predisposing allele but which has one or more 
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mismatches as compared. to other alleles (Mullis et al., 
supra, 1994) . One skilled in the art understands that the 
one or more nucleotide mismatches that distinguish between 
the fibrostenosis -predisposing allele and one or more other 
alleles are often located in the center of an 
allele-specif ic oligonucleotide primer to be used in 
allele-specif ic oligonucleotide hybridization. In contrast, 
an allele-specif ic oligonucleotide primer to be used in PCR 
amplification generally contains the one or more nucleotide 
mismatches that distinguish between the disease-associated 
and other alleles at the 3' end of the primer. 

A heteroduplex mobility assay (HMA) is another 
well known assay that can be used to detect the presence .or 
absence of a f ibrostenosis-predisposing allele in a method 
of the invention. HMA is useful for detecting the presence 
of a polymorphic sequence since a DNA duplex carrying a 
mismatch has reduced mobility in a polyacryl amide gel 
compared to the mobility of a perfectly base-paired duplex 
(Delwart et al.. Science 262:1257-1261 (1993); White et al . , 
Genomics 12:301-306 (1992)). 

The technique of single strand conformational 
polymorphism (SSCP) also can be used to detect the presence 
or absence of a f ibrostenosis-predisposing allele in a 
method of the invention (see Hayashi, Methods Anplic. 
1:34-38 (1991)). This technique is used to detect mutations 
based on differences in the secondary structure of 
single-strand DNA that produce an altered electrophoretic 
mobility upon non- denaturing gel electrophoresis. 
Polymorphic fragments are detected by comparison of the 
electrophoretic pattern of the test fragment to 
corresponding standard fragments containing known alleles. 
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Denaturing gradient gel electrophoresis (DGGE) 
also can be used to detect a f ibrostenosis -predisposing 
allele in a method of the invention. In DGGE, 
double- stranded DMA is electrophoresed in a gel containing 
an increasing concentration of denaturant; double -stranded 
fragments made up of mismatched alleles have segments that 
melt more rapidly, causing such fragments to migrate 
differently as compared to perfectly complementary sequences 
(Sheffield et al . , "Identifying DNA Polymorphisms by 
Denaturing Gradient Gel Electrophoresis" in Innis et al . , 
supra, 1990) . 

Other molecular methods useful for determining the 
presence or absence of a f ibrostenosis-predisposing allele 
are known in the art and useful in the methods of the 
invention. Other well-known- approaches for determining the 
presence or absence of a f ibrostenosis-predisposing allele 
include, without limitation, automated sequencing and RNAase 
mismatch techniques (Winter et al . , Proc. Natl, Acad. Sci. 
82:7575-7579 (1985)). Furthermore, one skilled in the art 
understands that, where the presence or absence of multiple 
alleles or a f ibrostenosis-predisposing haplotype is to be 
determined, individual alleles can be detected by any 
combination of molecular methods. See, in general, Birren 
et al. (Eds.) Genome Analysi s; A Laboratory Manual Volume 1 
(Analyzing DNA) New York, Cold Spring Harbor Laboratory 
Press (1997) . In addition, one skilled in the art 
understands that multiple alleles can be detected in 
individual reactions or in a single reaction (a "multiplex" 
assay) . In view of the above, one skilled in the art 
realizes that the methods of the invention for diagnosing or 
predicting susceptibility to a clinical subtype of Crohn's 
disease characterized by f ibrostenosing disease can be 
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practiced using one or any combination of the well known 
assays* described above or known in the art. 

The present invention further provides a method of 
diagnosing or predicting susceptibility to a clinical 
subtype of Crohn's disease characterized by f ibrostenosing 
disease by determining the presence or absence in an 
individual of a f ibrostenosis-predisposing haplotype, where 
the presence of the f ibrostenosis-predisposing haplotype is 
diagnostic of or predictive of susceptibility to the 
clinical subtype of Crohn's disease characterized by 
f ibrostenosing disease. In one embodiment , the 
f ibrostenosis-predisposing haplotype is associated with a 
clinical subtype of Crohn's disease characterized by 
f ibrostenosing disease with an odds ratio of at least 2 and 
a lower 95% confidence limit greater than 1. In another 
embodiment , the f ibrostenosis-predisposing haplotype is 
associated with a clinical subtype of Crohn's disease 
characterized by f ibrostenosing disease with an odds ratio 
of at least 3, at least 4, at least 5, at least 6, and a 
lower 95% confidence limit greater than 1. 



as used herein, means a combination of alleles that tend to 
be inherited together with the clinical subtype of Crohn's 
disease characterized by f ibrostenosing disease more often 
than would be expected according to traditional Mendelian 
genetics. In a method of the invention, the clinical 
subtype of Crohn's disease can be, for example, 
characterized by f ibrostenosing disease independent of small 
bowel involvement. In one embodiment, the f ibrostenosis- 
predisposing haplotype includes at least one allele linked 
to the N0D2 / CARD 1 5 locus. Ill another embodiment, the 



The term *f ibrostenosis-predisposing haplotype, 
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f ibrostenosis-predisposing haplotype includes a 
f ibrostenosis-predisposing allele. In a further embodiment, 
a f ibrostenosis-predisposing haplotype contains, for 
example, a variant allele at the N0D2/CARD15 locus. In 
another embodiment, the f ibrostenosis-predisposing haplotype 
includes the w 2" allele at SNP 8, SNP 12, or SNP 13. In a 
futher embodiment, the f ibrostenosis-predisposing haplotype 
includes the u 2" allele at SNP 13. In still further 
embodiments, the f ibrostenosis-predisposing haplotype 
includes the tt 2" allele at SNP 8, SNP 12, and SNP 13. In 
another embodiment, the f ibrostenosis-predisposing haplotype 
includes a JW1, JW15, JW16, JW17, or JW18 variant allele. 
One skilled in the art understands that a 

f ibrostenosis-predisposing haplotype can contain alleles 
that individually are not significantly associated the 
f ibrostenosing subtype of Crohn's disease, so long as the 
combination of alleles making up the haplotype tend to be 
•inherited together with the f ibrostenosing subtype of 
Crohn's disease more often than would be expected according 
to traditional Mendelian genetics. 

The presence or absence of a 
f ibrostenosis-predisposing haplotype can be accomplished 
using any of the methods described herein above for 
determining the presence or absence of a 

f ibrostenosis-predisposing allele. As an example, enzymatic 
amplification such as polymerase chain reaction 
amplification, for example, using one or more f luorescently 
labeled probes or one or more probes containing a DNA minor 
grove binder can be useful for determining the presence or 
absence of a f ibrostenosis-predisposing haplotype in a 
method of the invention. 
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Antibody based methods also can be useful for 



determining the presence or absence of a 
f ibrostenosis-predisposing allele or 

fibrostenosis -predisposing haplotype in a method of the 
invention. As an example, an antibody that is specifically 
reactive with a polypeptide or fragment thereof encoded by 
f ibrostenosis-predisposing allele can be used to detect the 
presence or absence q>f that allele in an individual. Such 
an antibody can be, for example, specifically reactive with 
the truncated version of N0D2/CARD15 generated by a w 2" 
allele at SNP 13 but not reactive with full-length or wild 
type NOD2/CARD15. 



include, without limitation, monoclonal and polyclonal 
antibodies, single chain antibodies, chimeric antibodies, 
bifunctional or bispecific antibodies, humanized antibodies, 
human antibodies, and complementary determining region 
(CDR) -grafted antibodies, including compounds which include 
CDR or antigen-binding sequences, which differentially bind 
to a polypeptide or fragment encoded by a 
f ibrostenosis-predisposing allele but not to other, non- 
predisposing alleles. Antibody fragments, including Fab, 
Fab', F(ab') 2# and Fv f also can be useful in the methods of 
the invention as can plastic antibodies or molecularly 
imprinted polymers (MIPs; Haupt and Mosbauch, TIBTech 
16:468-475 (1998)). Screening assays to determine 
differential binding specificity of an antibody are well 
known in the art (see Harlow et al . (Eds), Antibodies: A 
Laboratory Manual ; Cold Spring Harbor Laboratory; Cold 
Spring Harbor, N.Y. (1988)). 



Antibodies useful in the methods of the invention 
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Antibodies useful in a method of the invention can 
be produced using any method well known in the art, using a 
polypeptide, or immunogenic fragment thereof, encoded by a 
f ibrostenosis-predisposing allele. Immunogenic polypeptides 
or fragments can be isolated from natural sources, from 
recombinant host cells, or can be chemically synthesized. 
Methods for synthesizing such peptides are known in the art 
as described, for example, in Merrif ield, J - Amer . Chem . 
Soc. 85: 2149-2154 (1963); Krstenansky et al . , FEBS Lett. 
211:10 (1987) . 

Antibodies that differentially bind to 
polypeptides encoded by f ibrostenosis-predisposing alleles 
of the invention can be labeled with a detection moiety and 
used to detect the presence,, absence or amount of the 
encoded polypeptide in vivo, 'in vitro, or in situ. A 
moiety, such as a fluorescent molecule, can be linked to an 
antibody for use in a method of the invention. A moiety 
such as detection moiety can be linked to an antibody using, 
for example, carbodiimide conjugation (Bauminger and 
Wilchek, Meth. Enzvmol . 70:151-159 (1980)). 

Antibodies that differentially bind to 
polypeptides encoded by a f ibrostenosis-predisposing allele 
can be used in immunoassays. Immunoassays include, without 
limitation, radioimmunoassays, enzyme- linked immunosorbent 
assays (ELISAs) and immunoassays with f luorescently labeled 
antibodies, which are well known in the art. Antibodies can 
also be used to detect the presence or absence of a 
polypeptide of interest in a cell or tissue using 
immunohistochemistry or other in situ assays. Furthermore, 
cells containing a polypeptide of interest either on the 
surface of the cell or internally can be detected by an 
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antibody using assays such as fluorescence activated cell 
sorting (FACS) . One skilled in the art understands that 
these and other routine assays can be useful for determining 
the presence or absence of the gene product of a 
5 fibrostenosis -predisposing allele according to a method of 

the invention - 

The methods of the invention optionally include 
generating a report indicating the presence or absence in a 

10 individual of a fibrostenosis -predisposing allele or 

fibrostenosis -predisposing haplotype . The methods of the 
invention also optionally include generating a report 
indicating the presence or absence in the individual of a 
clinical subtype of Crohn's disease characterized by 

15 f ibrostenosing disease or the risk that an individual has of 

having or developing the f ibrostenosing subtype of Crohn's 
disease. 



A report can be in a variety of forms, for 
20 example, a report can be printed on paper or a report can be 

an electronic report that is not printed but is transmitted 
over an electronic medium such as electronic mail or a 
computer diskette. A report also can be an oral report that 
indicates the presence or absence in the individual of a 
25 f ibrostenosis-predisposing allele or a clinical subtype of 

Crohn 1 s disease characterized by f ibrostenosing disease. 

The invention also provides a method of optimizing 
therapy in an individual by determining the presence or 
30 absence in the individual of a f ibrostenosis-predisposing 

allele linked to a NOD2/CARD15 locus, diagnosing individuals 
in which the f ibrostenosis-predisposing allele is present as 
having a f ibrostenosing subtype of Crohn's disease, and 
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treating the individual having a f ibrostenosing subtype of 
Crohn's disease based on the diagnosis. Treatment for the 
f ibrostenosing subtype of Crohn's disease currently 
includes, for example, surgical removal of the affected, 
strictured part of the bowel. In one embodiment, the 
presence or absence of a f ibrostenosis-predisposing allele 
is determined in an individual having a known diagnosis of 
Crohn's disease. In another embodiment, the diagnosis is 
recorded in the form of a report as described above . 

The following examples are intended to illustrate 
but not limit the present invention. 



15 EXAMPLE I 

SELECTION AND CHARACTERIZATION OF STUDY SUBJECTS 

This example describes the clinical 
characterization of how two cohorts of Crohn's disease 
20 patients. 

A. Selection of Study Subjects 

Two cohorts of Crohn's disease patients were 
25 consecutively identified from an inflammatory bowel disease 

referral center (Cedars -Sinai Medical Center Inflammatory 
Bowel Disease Center) . The first cohort of 142 patients, 
ascertained between 1993-1996 and designated "CD1", was 
previously described in Vasiliauskas et al . , Gut 47:487-496 
30 (2000)). The second cohort of 59 patients (Cohort 2) was 

collected between 1999-2001 and designated " CD2 " . A cohort 
of 175 patients with ulcerative colitis was used as an 
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inflammatory bowel disease control group. This study, which 
was reviewed and approved for human subject participation by 
the Cedars -Sinai Institutional Review Board, involved the 
collection of clinical, serologic and genetic data from 
5 patients consenting to the study. 

A diagnosis of Crohn's disease in the cohort 
patients was defined by the presence of a combination of 
established features from at least two of the following 

10 categories: 1) clinical - perforating or fistulizing 

disease, obstructive symptoms secondary to small bowel 
stenosis or stricture; 2) endoscopic - deep linear or 
serpiginous ulcerations, discrete ulcers in normal -appearing 
mucosa, cobblestoning, or discontinuous or asymmetric 

15 inflammation; 3) radiographic - segmental disease (skip 

lesions) , small bowel or colon strictures, stenosis, or 
fistula, and; 4) histopathologic - submucosal or transmural 
inflammation, multiple granulomas, marked focal cryptitis or 
focal chronic inflammatory infiltration within and between 

20 biopsies, or skip lesions including rectal sparing in the 

absence of local therapy. 

To identify clinical features and immunological 
traits associated with allelic variants of the NOD2 / CARD1 5 

25 gene, the study was designed to analyze two consecutively 

ascertained cohorts of patients with Crohn's disease. The 
first cohort was > used to explore the relationship of 
NOD2 / CARD1 5 alleles with an array of clinical and serologic 
variables, thereby generating hypotheses. The second cohort 

30 was then used to confirm the specific hypotheses generated 

from analysis of the first cohort. To minimize type I error 
and maximize statistical power, the significance of the 
associations in the first cohort were permitted to be less 
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stringent (p< 0.1) than in the second cohort (p< 0.05) . By 
avoiding a highly stringent correction for the number of 
variables examined in the first cohort, there was an 
increased ability to identify specific associations between 
NOD2/CARD15 allele variants and clinical variables - 

B. Clinical Characterization of Study Subjects 

Patients with- Crohn 1 s disease were characterized 
as having f ibrostenosing disease, internal -perforating 
disease, perianal fistulizing disease or ulcerative colitis 
(UC) -like disease based on previously described criteria 
(Gasche et al . , Inflammatory Bowel Diseases 6:8-15 (2000); 
Vasiliauskas et al.. Gut 47:487-496 (2000); Vasiliauskas et 
al., Gastroenterology 110:1810-1819 (1996); and Greens tein 
et al., Gut 29:588-592 (1988)). According to well 
established criteria, f ibrostenosing disease was defined by 
documented persistent intestinal .obstruction or an 
intestinal resection for an intestinal obstruction. 
Internal perforating disease was defined as current or 
previous evidence of entero-enteric or entero-vesicular 
fistulae, intraabdominal abscesses, or small bowel 
perforation. Perianal perforating disease was defined by 
current or previous evidence of either perianal fistulae or 
abscesses or rectovaginal fistula. UC-like disease was 
defined by current or previous evidence of left -sided 
colonic involvement, symptoms of bleeding or urgency, and 
crypt abscesses on colonic biopsies as previously described. 
Disease location was classified as small bowel, colon, or 
both based on one or more endoscopic, radiologic or 
pathologic studies. A panel of inflammatory bowel disease 
physicians unaware of the results of serologic and genetic 
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testing reached a consensus on phenotype based on the 
clinical data. 

Serum ANCA expression and ANCA subtype 
5 characterization were performed by fixed neutrophil 

enzyme-linked immunosorbent assay (ELISA) as previously 
described (Saxon et al . , J. Allergy Clin. Immunol. 
86:202-210 (1990)). Briefly, human peripheral blood 
neutrophils fixed with methanol were reacted with control 

10 and coded sera at a 1:100 dilution. Anti-human 

immunoglobulin G (IgG) ( chain-specific) antibody (Jackson 
Immunoresearch Labs, Inc.; West Grove., PA) conjugated to 
alkaline phosphatase was added to label neutrophil -bound 
antibody, and a colorimetric reaction was performed. Levels 

15 * were determined relative to a standard consisting of pooled 
sera obtained from well - character i zed pANCA+ UC patients. 
Results were expressed as ELISA units. (EU) per milliliter. 
ANCA+ sera were further subtyped via indirect 
immunof luorescent staining to determine the ANCA neutrophil 

20. binding pattern, as previously described (Saxon et al . , 

supra, 1990) . Sera showing the characteristic perinuclear 
highlighting and losing the characteristic staining pattern 
when treated with deoxyribonuclease were termed pANCA+ 
(Vidrich et al . , J. Clin. Immunol. 15:293-299 (1995)). For 

25 the purposes of this study, patients were considered pANCA+ 

if they were both positive for ANCA by ELISA and lost 
perinuclear immunofluorescence staining with 
deoxyribonuclease treatment . 

30 Sera were analyzed for ASCA expression in a 

blinded fashion by using a fixed ELISA assay (Vasiliauskas 
et al., Gut 47:487-496 (2000); and Vermeire et al . , 
Gastroenterology 120:827-833 (2001)). Two patients in the 
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second cohort did not undergo ASCA testing. High-binding 
polystyrene microtiter plates were coated with purified 
phosphopeptidomannans extracted from yeast S. uvarum, a 
subspecies of S. cerevisiae. Coded patient sera were 
diluted and added to the wells, followed by an enzyme -linked 
colorimetric reaction. Color development was proportional 
to concentrations of ASCA antibody present in the sera. 
Levels were determined and results expressed as EU per 
milliliter, relative to a standard, which was derived from a 
pool of patient sera with well-characterized Crohn's disease 
found to have reactivity to this antigen. Sera showing ASCA 
IgG reactivity of >40 EU/mL or IgA reactivity of >2 0 EU/mL 
were termed ASCA+. Serological assays were performed at 
Cedars-Sinai Medical Center and Prometheus Laboratories (San 
Diego, CA) using substantially identical methodology. The 
clinical characteristics of the two Crohn's disease cohorts 
are shown in Table 1. 
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TABLE 1 

Clinical Characteristics of Two Crohn's Dig 


sease Cohorts 


Clinical 
characteristics 


CD1 


CD2 






n=142 


n=59 


P 


Gender (M/F) 


79/63 


33/26 


L/ . _7 D -7 


Age at onset 


22 (4-67) 


22 (2-58) 


0.6621 


liunnicicy 
(Jew/Non-Jew) 


60/82 


23/36 


0.668 


Disease location (%) 








SB only 


19.0 




0.496 


Colon only 


20.4 


20.8 




SB and Colon 


60.6 


52 .8 




Perianal perforating 
(%) 


35.9 


28.8 


0.332 


Internal perforating 
(%) 


47.2 


23.7 


0 .002 


Fibrostenosing disease 
(%) 


59.9 


30.5 


0.001 


UC-like (%) 


39 .4 


22 .0 


0 .018 


pANCA-positive (%) 


19.7 


12.5 


0.295 


ASCA- pos i t i ve * ( % ) 


57.0 


38.6 


0.019 



EXAMPLE II 

PATIENTS WITH CROHN'S DISEASE HAVE AN INCREASED FREQUENCY OF 
RARE ALLELIC VARIANTS OF NOD 2 / CARPI 5 

This example describes the association of a "2 n 
allele at SNP 8, SNP 12, or SNP 13 within the NOD2/CARD15 
locus (rare allelic variants of NOD2/CARD15) with Crohn's 
disease in a North American population. 

In order to determine whether the North American 
Crohn's disease patient populations in Cohorts 1 and 2 
expressed allelic variants of NOD2 / CARD1 5 , Cohort 1 
(hypothesis-generating) and Cohort 2 (hypothesis-confirming) 
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were genotyped for the rare allelic variant of SNP 8 
(R675W) , SNP 12 (G881R) and SNP 13 (3020insC) . A cohort of 
ulcerative colitis patients was used for comparison. 

Genotyping was performed using a genotyping assay 
employing 5 1 -exonuclease technology, the TaqMan MGB™ assay 
(PE Biosystems; Foster City, CA) . Primers were designed 
using the software PrimerExpress 1.5™ (PE Biosystems) and 
sequence information found in dbSNP for NOD2/CARD15 SNP 5, 
8, 12, and 13. The MGB™ design adds a "minor groove binder" 
to the 3'' end of the TaqMan™ probes, thereby increasing the 
binding temperature of the probe and enabling the use of 
shorter probes than in conventional TaqMan™ assays (Kutyavin 
et al., Nucleic Acids Res. 25:3718-3723 (1997)). This has 
the effect of increasing the discrimination between the 
alleles in the assay (Kutyavin et al . , Nucleic Acids Res. 
28:655-661 (2000)). Assays were performed following the 
manufacturer's recommendations (PE Biosystems bulletin 
4317594) in an ABI 7900 instrument. Genotyping was 
performed blinded to clinical status of the subjects. 
Primers and probes used in the genotyping assay are shown in 
Tables 2 and 3. 
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Table 2 








Primers 


Used in Taqman 


MGB™ Assay for SNPs 5, 










8, 12 


and 13 






5 


SNP 


Forward 


Reverse Primer 


SEQ 


ID 




Primer 


Primer : 




NO 






5 


5 1 GGTGGCTGGGC 


5 1 CTCGCTTCCTCAGTACCTAT 


for 


37 






TCTTCT 3 1 


GATG 3' 


rev 


38 




8 


5 ' CTGGCTGAGTG 




for 


39 






CCAGACATCT 3 ■ 


5 " GGCGGGATGGAGTGGAA 3 1 


rev 


40 




12 


5 ' CCACCTCAAGC 


5 1 GTTGACTCTTTTGGCCTTTT 


for 


41 






TCTGGTGATC 3 1 . 


CAG 3' 


rev 


42 


10 


13 


5 1 CCTTACCAGAC 


5 1 TGTCCAATAACTGCATCACC 


for 


43 






TTCCAGGATGGT 


TACCT 3' 


rev 


44 






3' 
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Table 3 
TAQMAN PROBES 


Allele 
detected 


Probe sequence 


Sea 
ID NO 


SNP5 

wild type allele 
1 ( u l") 


6 FAM - CATGGCTGGACCC - MGBNFQ 


45 


SNP5 

variant allele ("2") 


TET - CATGGCTGGAT CC - MGBNFQ 


46 


SNP8 

wild type allele 
(«!«.) 


6 FAM - TGCTCCGGCGCCA - MGBNFQ 


47 


SNP8 

variant allele ( u 2 n ) 


TET- CTGCTCTGGCGCCA- MGBNFQ 


48 


SNP12 
wild type allele 
("1") 


6FAM-CTCTGTTGCCCCAGAA- 
MGBNFQ 


49 


SNP12 

wild type allele 
(«!„) 


TET- CTCTGTTGCGCCAGA-MGBNFQ 


50 


SNP13 
wild type allele 
("1") 


TET - CTTTCAAGGGCCTGC - MGBNFQ 


51 


SNP13 

variant allele ("2") 


6 FAM - CCTTTCAAGGGGCCT - MGBNFQ 


52 


JWl 

wild type allele 


6 FAM - AAGACTCGAGTGTCCT - 
MGBNFQ 


53 


JWl 
variant 


VIC - AGACTCAAGTGTCCTC - MGBNFQ 


54 
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As shown in Table 4, each of three rare allelic 
variants of NOD2/CARD15 (a "2" allele at SNP 8, SNP 12 , or 
SNP 13) was significantly more frequent in patients with 
Crohn ! s disease compared with ulcerative colitis. In 
addition, as can be seen in Table 4, the frequency of each 
of the NOD2 / CARD1 5 rare allelic variants was similar in each 
cohort of Crohn's disease patients, supporting the combined 
use of the two cohorts. The overall frequency of any of the „ 
three NOD2 / CARD1 5 rare allelic variants was 35% in Crohn's 
disease patients compared with 11% in ulcerative colitis 
patients (p=0.001). 

Within the combined Crohn's disease cohort, the 
frequency of hornozygotes with a "2" allele at SNP 13 
(302 0insC) and compound rare allelic heterozygotes was 1% 
and 4%, respectively, while none of the ulcerative colitis 
patients had such a genotype. These results demonstrate 
that rare allelic variants of NOD2/CARD15 are associated 
with Crohn's disease across diverse geographic and 
ethnically-defined patient populations. 



Table 4 

Frequency of NOD2/CARD15 Rare Allelic Variants in CD and 

UC Patient Populations 



Allelic 
variants 


UC 
(n=175) 


CD1 
(n=142) 


CD2 
(n=59) 


Combined CD1 
and CD 2 
(n=201) 


P 

(TJC vs. 
Combined 
CD) 


R675W 












(SNP 8 w 2" 


5.7% 


16.9% 


15.3% 


16.4% 


0.001 


allele) 












G881R 












(SNP 12 "2" 


1-7% 


12.0% 


10.2% 


11.4% 


0.0001 


allele) 












3020insC 












(SNP 13 xx 2" 


3.4% 


11.3% 


11.9% 


11.4% 


0.004 


allele) 












Carriage of 












any allelic 


10 .9% 


36.6% 


32.2% 


35.3% 


.0.001 


variant 
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EXAMPLE III 

RARE VARIANT ALLELES IN THE NOD 2 /CARD 15 LOCUS ARE ASSOCIATED 
WITH THE FIBROSTENOSING SUBTYPE OF CROHN'S DISEASE IN 

COHORT 1 

5 

This example demonstrates that a "2" allele at 
SNP 8, SNP 12, or SNP 13 is significantly associated with 
f ibrostenosing disease in Cohort 1. 

10 Patients with Crohn's disease express diverse 

clinical phenotypes that can be due to differences in 
underlying genetic factors. In order to determine whether 
rare variant alleles at the NOD2 / CARD1 5 locus were 
associated with specific Crohn's disease-related clinical 

15 phenotypes or disease-related serum immune markers, 

univariate analysis was performed. The univariate analysis 
evaluated the association between NOD2/CARD15 allelic 
variants at SNP 8, SNP 12, or SNP 13 and predefined clinical 
characteristics, including age of onset, disease location, 

20 and disease phenotype (f ibrostenosing disease, 

internal -perforating disease, perianal fistulizing disease 
or ulcerative colitis-like disease) - The association 
between NOD2/CARD15 allelic variants and expression of the 
serum immune markers ASCA and pANCA was also tested. 

25 

As shown in Table 5, univariate analysis indicated 
that a "2" allele at SNP 8, SNP 12, or SNP 13 of the 
N0D2/CARD15 locus was significantly associated with 
f ibrostenosing disease in Cohort 1 (p=0.049) for the three 
30 allelic variants combined. A positive association at a less 

stringent significance level (p< 0.1) was also observed with 
small bowel involvement and younger age of onset, and a 
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negative association was observed with UC-like disease in 
this cohort. With respect to serologic markers, patients 
with the "2" allele at SNP 13 were more likely to express 
ASCA (p=0.053). These results demonstrate that a "2" allele 
at SNP 8, SNP 12 , or SNP 13 is significantly associated with 
f ibrostenosing disease in Cohort 1. . 



computer software (Version 6.10; SAS Institute, Inc.; Cary, 
NC) . Quantitative variables were described as medians with 
a range. Nonpararnetric statistical tests were used to test 
differences in quantitative variables between two groups. A 
Chi-square test or Fisher's exact test (when the expected 
number was less than 5) was used to evaluate associations 
between carriers and non-carriers of the rare alleles or 
between genotypes and categorical variables , such as type of . 
IBD, disease location, disease behavior, and antibody 
posit ivity. In addition, a Mantel Haenszel stratified 
association test wag performed for all genotype and 
phenotype associations by controlling for potential 
confounding effect due to ethnic variation. This stratified 
association test was also used to evaluate whether the small 
bowel involvement and f ibrostenosing disease were 
independently associated with NOD2 / CARD 1 5 variants (see 
Example IX below) . 



Statistical analysis was performed using SAS 
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Table 5 

Relationship of N0D2/CARD15 Rare Variant Alleles and Clinical 
Phenotypes of Cronos Disease in Cohort 1 



Qualitative Trait 
%N0D2 variant carriers 



Clinical phenotypes 






R675W 
(SNP 8) 


G881R 
(SNP 12) 


3020insC 
(SNP 13) 


Carriage of 
any allelic 
variant 


Small bowel 
involvement 


yes 
no 


n 
113 

29 


19.47% 
6.90% 


11.50% 
13.79% 


14.16% 
0.00% 


40.71% 
20.69% ' 


p value 






0.081 


0.494 


0.04 


0.063 


Perianal perforating 
p value 


yes 
no 


51 
91 


11.76% 
19.78% 
0.248 


11.76% 
12.09% 
0.839 


13 .73% 
9.89% 
0.547 


35.29% 
37.36% 
0.747 


Internal perforating 
p value 


yes 
no 


67 
75 


13.43% 
20.00% 
0 . 346 


16.42% 
8-00% 

f\ no 
0 . 178 


11.94% 
10.67% 

A Q1 


37.31% 
36.00% 

n ckc 
u • 


Fibrostenosing 
p value 


yes 
no 


85 
57 


18.82% 
14 .04% 
0 - 389 


14 . 12% 
8.77% 
0.458. 


15.29% 
5 .26% 
0.084 


43.53% 
26.32% 

A fSA Q 


UC-like 
p value 


yes 
no 


56 
86 


17 .86% 
16.28% 
0.822 


10 .71% 
12 .79% 
0.736 


5.36% 
15.12% 
0.07.6 


30.36% 
40.70% 
0.22 


pANCA positive 
p value 


yes 
no 


28 
114 


17.86% 
16.67% 
0.82 


14.2 9% 
11.40% 
0.793 


7.14% 
12 .28% 
0.394 


32.14% 
37.72% 
0.529 


ASCA positive 
p value 


yes 
no 


81 
61 


18.52% 
14 .75% 
0 .467 


9.88% 
14.75% 
0.234 


16.05% 
4.92% 
0.053 


38.27% 
34.43% 
0.744 










Quantitative Trait 
median (range) 










R675W 


G881R 


3020insC 


Carriage of 



(SNP 8) 



(SNP 12) (SNP 13) any allelic 

variant 



Age of onset 

carrier of N0D2 
variant 

P 



yes 
no 



22 (6-67) 22 (4-62) 19 (10-50) 20 (4-67) 

22 (4-63) 22 (4-67) 22 (4-67) 22 (4-63) 
0.715 0.937 0.074 0.238 
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EXAMPLE IV 

RARE VARIANT ALLELES IN THE NOD2 / CARPI 5 LOCUS ARE ASSOCIATED 
WITH THE FIBROSTENOS ING SUBTYPE OF CROHN'S DISEASE IN 

COHORT 2 

5 

This example demonstrates that a "2" allele at 
SNP 8, SNP 12, or SNP 13 of the NOD2/CARD15 locus is 
significantly associated with f ibrostenosing disease in 
Cohort 2 . 

10 

The results obtained with Cohort 1 in Example III 
indicated that NOD2/CARD15 rare variant alleles were 
positively associated with f ibrostenosing Crohn's disease, 
small bowel involvement, ASCA positivity, and younger age of 

15 onset, and negatively associated with UC-like disease (see 

Table 5) . These hypotheses were further tested using Cohort 
2; the results are shown in Table 6. As with Cohort 1, 
Cohort 2 demonstrated a significant association between a 
"2" allele at SNP 8, SNP 12, or SNP 13 of the NOD2/CARD15 

20 locus and f ibrostenosing disease (p=0.002, with Bonferroni 

correction p=0.01; see Table 6). These results indicate 
that a "2" allele at SNP 8, SNP 12, or SNP 13 of the 
N0D2 / CARD1 5 locus is significantly associated with 
f ibrostenosing disease in Cohort 2 . 



103S 
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Table 6 

Relationship of N0D2/CARD15 Rare Variant Alleles and Clinical 
Phenotypes of Crohn's Disease in Cohort 2 



Clinical 
phenotypes 



R675W 
(SNP 8) 



Qualitative Trait 
%NOD2 variant 
carriers 



G881R 3020insC Carriage of any 
(SNP 12) (SNP 13) allelic variant 







n 


















Fibrostenosing 


yes 


18 


22 


.22% 


22 


.22% 


27 


.78% 


61 


.11% 




no 


41 


12. 


.20% 


4 . 


88% 


4 . 


88% 


19 


.51% 


P 






0 . 


315 


0. 


048 


0. 


018 


0. 


002 


Small bowel 


yes 


42 


19 


• 05% 


9. 


52% 


14 


.29% 


35 


.71% 


involvement 
























no 


17 


5. 


88% 


11 


.76% 


5. 


88% 


23 


.53% 


P 






0 


.22 


0. 


828 


0. 


288 


0 . 


354 


ASCA positive 


yes 


22 


9. 


09% 


13 


.64% 


13 


.64% 


31 


.82% 




no 


35 


17 


.14% 


8. 


57% 


11 


.43% 


31 


.43% 


P 






C 


1.4 


0. 


542 


0. 


735 


0. 


956 



Quantitative Trait 
median (range) 



Age of onset 

carrier of N0D2 yes 
variant 

no 

P 



R675W G881R 3 020ins Carriage of 

(SNP 8) (SNP 12) C (SNP any allelic 

13) variant 

27 (10-58) 26 (7-33) 17 (13-35) 22 (7-58) 

19 (2-55) 20 (2-58) 24 (2-58) 22 (2-55) 
0.332 0.9 0.566 0.981 



a ::;S-0l£; .-013003 
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EXAMPLE V 

RARE VARIANT ALLELES IN THE NOD2/CARD15 LOCUS ARE ASSOCIATED 
WITH THE FIBROSTENOSING SUBTYPE OF CROHN'S DISEASE IN A 
COMBINED COHORT REPRESENTING COHORTS 1 AND 2 

This example demonstrates that a "2" allele at 
SNP 8, SNP 12 , or SNP 13 is significantly associated with 
f ibrostenosing disease in a combined cohort representing 
cohorts 1 and 2 „ 



As seen in Figure 2, the association between a "2" 
allele at SNP 8, SNP 12, or SNP 13 of the NOD2/CARD15 locus 
(NOD2 variant carrier) and f ibrostenosing disease was even 
more significant (p=0.001) when the two cohorts were 
15 analyzed together than when analyzed separately. 

The association between a "2" allele at SNP 8, 
SNP 12, or SNP 13 of the N0D2 / CARD1 5 locus (N0D2 variant 
carrier) and f ibrostenosing disease was observed in both 

20 Jewish and non- Jewish individuals. Approximately 46% of 

Crohn's disease patients with f ibrostenosing disease (Jews 
52% vs. non-Jews 42%) had at least one of these rare alleles 
compared with only 23% (Jews 21.6% vs. non- Jews 25%) of 
Crohn's disease patients without f ibrostenosing disease 

25 (Odds ratio, 2.8; 95% Confidence interval, 1.56-5.18). Of 

the three rare variant alleles, the frameshift mutation 
3020insC (a "2" allele at SNP 13) demonstrated the greatest 
association with f ibrostenosing disease (47% vs. 17%, 
p=0.006 for cohorts combined). These results indicate that 

30 rare variant alleles in the NOD2/CARD15 locus are associated 

with the f ibrostenosing subtype of Crohn's disease in a 
combined cohort representing Cohorts 1 and 2 . 
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EXAMPLE VI 

CROHN'S DISEASE PATIENTS WITH HOMOZYGOUS MUTATIONS OR 
COMPOUND HETEROZYGOUS MUTATIONS IN NOD2/CARD15 

This example describes the increased risk of 
f ibrostenosing disease in Crohn's disease patients carrying 
homozygous mutations or compound heterozygous mutations in 
NOD2/CARD15 locus. 

As shown in Figure 3, compared with patients who 
were not carriers of NOD2 / CARD 1 5 mutations at SNP 8, SNP 12 
or SNP 13, patients who were carriers of two mutations in 
NOD2/CARD15 were significantly more likely to have 
f ibrostenosing disease (85% vs. 43%; odds ratio 7.4; 95% 
confidence interval 1.9-28.9, p=0.004). Patients who were 
carriers of a single NOD2/CARD15 mutation were also 
significantly more likely to have f ibrostenosing disease 
when compared with patients who were not carriers of these 
NOD2 /CARD 15 mutations (64% vs. 43%; Odds ratio 2.37; 95% 
Confidence interval 1.26-4.47; p=0.008). These results 
indicate that Crohn f s disease patients with homozygous 
mutations or compound heterozygous mutations in NOD2 / CARD1 5 
have an increased risk of f ibrostenosing disease. 

EXAMPLE VII 

FIBROSTENOSING DISEASE ONLY COMPARED TO FIBROSTENOSING AND 

PERFORATING DISEASE 

This example describes the association of the "2" 
allele at SNP 8, SNP 12, or SNP 13 of the NOD2 / CARD1 5 locus 
in patients with f ibrostenosing disease compared to patients 
with f ibrostenosing and perforating disease. 
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Fibrostenosing and perforating disease often occur 
in the same patient. Patients with fibrostenosing disease 
can be characterized as i) having only fibrostenosing 
disease or ii) having both fibrostenosing and perforating 
disease. In order to address these phenotypes individually, 
patients in each of the two cohorts were separated by the 
presence of fibrostenosing disease with perforating 
complications (Fib + perf) or without perforating 
complications (Fib only) and compared with patients with 
perforating complications without evidence of f ibrostenosis 
(Perf only) . The percentage of patients having only 
fibrostenosing disease that carried a "2". allele at SNP 8, 
SNP 12, or SNP 13 of the NOD2/CARD15. locus was 48.3%, which 
was similar to that seen in patients with both 
fibrostenosing and perforating complications (46.0%; p=0.8). 
As seen in Figure 3, when patients with fibrostenosing 
disease were compared with those patients, described as 
having perforating disease only (perianal or internal), the 
..frequency of the "2" allele at SNP 8, SNP 12 or SNP 13 of 
the NOD2/CARD15 locus in patients with fibrostenosing 
disease (with or without perforating complications) was 
significantly greater than that seen in patients with only 
perforating complications (46.6% versus 18.6%; p=0.002). 

EXAMPLE VIII 

MTTT.TTVARIANT ANALYSIS OF THE COMB I NED PATIENT COHORT 

This example demonstrates that fibrostenosing 
disease is independently associated with a "2" allele at SNP 
8, SNP 12, or SNP 13 of the NOD2 / CARD1 5 locus. 
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For multivariant analysis, all variables with at 
least borderline significance (p< 0.1) in either cohort were 
tested simultaneously for their association with a "2" 
allele at SNP 8, SNP 12, or SNP 13 of the NOD2/CARD15 locus 
by using logistic regression. As shown in Table 7, the 
clinical phenotype of f ibrostenosing disease was 
significantly associated (p< 0.05) with these rare alleles 
at NOD2/CARD15 locus (OR 2.8; 95% CI, 1.3-6.0). These 
results confirm that f ibrostenosing disease is independently 
associated with a "2" allele at SNP 8, SNP 12, or SNP 13 of 
the NOD2/CARD15 locus. 

Table 7 

Multivariate Analysis in the Combined 

Cohort for 5 Phenotypic Variables 

C linical phenotypes OR 95% CI P 
Fibrostenosing disease 2.8 1.3-6.0 0.011 
Small bowel involvement 1.3 0.5-3.4 0.561 
UC-like 0.9 0.4-1.7 0.658 

ASCA positive 0.7 0.3-1.3 0.250 

Age of onset 1.0 0.9-1-0 0-874 

EXAMPLE IX 

FIBROSTENOSING DISEASE AND SMALL - BOWEL INVOLVEMENT 

This example demonstrates that the association 
between fibrostenosing disease and a "2" allele at SNP 8, 
SNP 12, or SNP 13 of the NOD2 / CARD 1 5 locus is independent of 
small -bowel involvement. 
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Because f ibrostenosing disease is more likely to 



occur in patients with small -bowel involvement, patients 
were stratified on the basis of small -bowel involvement to 
analyze whether the association between f ibrostenosing 
disease and NOD2/CARD15 variant alleles was a primary 
association. Among patients with small -bowel involvement, 
26.4% of patients who did not have f ibrostenosing disease 
(n=53) had a "2" allele at SNP 8, SNP 12, or SNP 13, whereas 
46.1% of patients who had f ibrostenosing disease (n=102) had 
a "2" allele at SNP 8, SNP 12, or SNP 13(p=0.017). A 
similar trend was observed among patients without 
small-bowel involvement (p=0.05), and the combined analysis 
conditioning on small -bowel involvement yielded a 
significance level of 0.009. 



small-bowel involvement was not associated with a "2" allele 
at SNP 8, SNP 12, or SNP 13 of the NOD2/CARD15 locus (p= 
0.63). This result agrees with the results from logistic 
regression analysis (see Example VIII) and indicates that 
the association between f ibrostenosing disease and a "2" 
allele at SNP 8, SNP 12, or SNP 13 of the NOD2/CARD15 locus 
is independent of small-bowel involvement. These results 
further indicate that the observed small -bowel association 
with a "2" allele at SNP 8, SNP 12, or SNP 13 of the 
NOD2/CARD15 locus is secondary to the presence of 
f ibrostenosing disease. 



citations provided above, in parentheses or otherwise, 
whether previously stated or not, are incorporated herein by 
reference. 



After controlling for f ibrostenosing disease, 



All journal article, reference, and patent 
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Although the invention has been described with 
reference to the examples above, it should be understood 
that various modifications can be made without departing 
from the spirit of the invention. Accordingly, the 
invention is limited only by the following claims. 
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We claim: 



1. A method of diagnosing or predicting 
susceptibility to a clinical subtype of Crohn's disease 
characterized by f ibrostenosing disease, comprising 

determining the presence or absence in an 
individual of a f ibrostenosis-predisposing allele linked to 
a NOD2/CARD15 locus , 

4 

wherein the presence of said 
fibrostenosis-predisposing allele is diagnostic of or 
predictive of susceptibility to the clinical subtype of 
Crohn's disease characterized by f ibrostenosing disease. 

2. The method of claim 1, wherein said clinical 
subtype of Crohn's disease is characterized by 

f ibrostenosing disease independent of small bowel 
involvement . 

3. The method of claim 1, wherein said 
fibrostenosis-predisposing allele is located within said 
NOD2 / CARD 1 5 locus. 

4. The method of claim 3, wherein NF-kappa B 
activation by a NOD2 / CARD 1 5 polypeptide encoded by said 
fibrostenosis-predisposing allele is reduced as compared to 
NF-kappa B activation by a wild-type N0D2/CARD15 
polypeptide . 

5. The method of claim 3, wherein said 
fibrostenosis-predisposing allele is located in a coding 
region of said NOD2/CARD15 locus. 



10 



15 
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6. The method of claim 5, wherein said 
f ibrostenosis-predisposing allele is located in a region 
encoding residues 744 to 1020 of NOD2/CARD15. 

7. The method of claim 5, wherein said 
f ibrostenosis-predisposing allele is a "2" allele at a SNP 
selected from SNP 8, SNP 12, and SNP 13. 

8- The method of claim 7, wherein said 
f ibrostenosis-predisposing allele is a "2" allele at SNP 13. 

9. The method of claim 3, wherein said 
f ibrostenosis-predisposing allele is located in a non-coding 
region of Said NOD2/CARD15 locus.. 

10. The method of claim 9, wherein said 

f ibrostenosis-predisposing allele is selected from a JW1, 
JW15, and JW16 variant allele. 

20 11- The method of claim 9, wherein said 

f ibrostenosis-predisposing allele is located in a promoter 
region of said N0D2/CARD15 locus. 
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12. The method of claim 11, wherein said 
f ibrostenosis-predisposing allele is an allele selected from 
a JW17 and JW18 variant allele. 
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13. The method of claim 1, comprising determining 
the presence or absence in said individual of at least two 

f ibrostenosis -predisposing alleles linked to a NOD2/CARD15 
locus, 

wherein the presence of one or more of said 
f ibrostenosis-predisposing alleles is diagnostic of or 
predictive of susceptibility to the clinical subtype of 
Crohn's disease characterized by f ibrostenosijig disease. 

14. The method of claim 13 , wherein said at least 
two f ibrostenosis-predisposing alleles are "2" alleles at a 
SNP selected from SNP 8, SNP 12, and SNP 13. 

15. The method of claim 14 , comprising 
determining the presence or absence in said individual of 

(i) a "2" allele at SNP 8, 

(ii) a "2" allele at SNP 12 , and 

(iii) a "2" allele at SNP 13, 

wherein the presence of one or more of said "2" 
alleles at SNP 8, SNP 12 , and SNP 13 is diagnostic of or 
predictive of susceptibility to the clinical subtype of 
Crohn's disease characterized by f ibrostenosing disease. 

16. The method of claim 1, wherein said 
f ibrostenosis-predisposing allele is associated with said 
clinical subtype of Crohn's disease characterized by 
fibrostenosing disease with an odds ratio of at least 2 and 
a lower 95% confidence limit greater than 1. 
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17. The method of claim 1, further comprising 
generating a report indicating the presence or absence in 
said individual of said f ibrostenosis-predisposing allele. 

5 18. The method of claim 1, further comprising 

generating a report indicating the presence or absence in 
said individual of said clinical subtype of Crohn's disease 
characterized by f ibrostenosing disease. 

10 19. The method of claim 1, wherein determining 

the presence or absence of said f ibrostenosis-predisposing 
allele comprises enzymatic amplification of nucleic acid 
from said individual. 

15 20. The method of claim 19 , wherein said 

amplification is polymerase chain reaction amplification. 

21. The method of claim 20 , wherein said 
polymerase chain reaction amplification is performed using 
20 one or more f luorescently labeled probes. 
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22. The method of claim 20, wherein said 
polymerase chain reaction amplification is performed using 
one or more probes comprising a DNA minor grove binder. 
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23. A method of optimizing therapy in an 
individual , comprising 

(a) determining the presence or absence in said 
individual of a f ibrostenosis-predisposing allele linked to 
a NOD2/CARD15 locus, 

(b) diagnosing individuals in which said 
fibrostenosis-predisposing allele is present as having a 
fibrostenosing subtype of Crohn's disease, and 

(c) treating said individual having a fibrostenosing 
subtype of Crohn's disease based on said diagnosis. 



4b «4J±^U€i^ 
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ABSTRACT OP THE DISCIiOSURE 
METHODS OF USING A NOD2 / CARPI 5 HAPLOTYPE TO 
DIAGNOSE CROHN'S DISEASE 

5 The present invention provides a method of 

diagnosing or predicting susceptibility to a clinical 
subtype of Crohn's disease characterized by f ibrostenosing 
disease by determining the presence or absence in an 
individual of a f ibrostenosis -predisposing allele linked to 

10 a NOD2/CARD15 locus, where the presence of the 

f ibrostenosis-predisposing allele is diagnostic of or 
predictive of susceptibility to the clinical subtype of 
Crohn's disease characterized by f ibrostenosing disease. In 
a method of the invention, the clinical subtype of Crohn's 

15 disease can be, for example, characterized by f ibrostenosing 

disease independent of small bowel involvement . The 
invention also provides a method of optimizing therapy in an 
individual by determining the presence or absence in the 
individual of a f ibrostenosis -predisposing allele linked to 

20 a NOD2/CARD15 locus, diagnosing individuals in which the 

f ibrostenosis-predisposing allele is present as having a 
f ibrostenosing subtype of Crohn's disease, and treating the 
individual having a f ibrostenosing subtype of Crohn's 
disease based on the diagnosis. 
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Figure 2 
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Figure 3 
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FIGURE 5 
SNP 8 



5 ' acnrrcaGar cacaGcaGcr:' viujii£3C?g g gcigtigi c cosaaozac 5D 

3 ' TCCSyCICm GIGIOJIQGG ^GSGOGTIC COaCaACSG GGCCCIQ3IG 

JOXUi^CG ftOOSftCTCAC GGICIUI3AGA CICTTOCDGGG .acgaaoccgc 

acafe S i ' iG i G COCGCCIGgr' GIUIGGCCGG , efiGCUTCCGC: AflQCftCITCC ISO 
agtiaDGAC^C Q33CG3aCCA CSg£033GC GIOGGSQ3GG TKJSH 



jCTCC^IOa: GOa tdClUL A OCAAG^GOSr GCSGG3Cft2G 200 

raGaagiaQas QQ^r garc?r Gbcocacrcc gctictqqca 

CCG323ITCA TCIQQCICKr GG3GSCCCIG' T&OGSGATOC 250. 
QQGGOZASGT ACaCOSGIA G3GCTO3GaC. MGCICISCG TCCICCIQ3C 

GCT33CIQ3G AAGGCTGCAC GIGGOC7IGSA TC3TIG3GC?!C 300 
aS?mSGCC' T IOQGfiOSIG C&OTSGaCTT ACAACCEGIG GRGTKMCr 

CWlTiUCSG 1GIQGQCQOC ^CIG?GTGIG CIG00C1QQC CTITGIGCIG 350 

Grnyw^Gic ac?jcccqggg igscicscsc eOGsaaoG gaaacaogpc 
CAoc?ccicr: oacoGCoasr ogcccigcjc ciqssgisca asicigigqg 400 

GTCGIGGfQC- 0O3CG33aqv G0G3GftC3IC G^CCIGnlGT TG2¥3£2CX 

T^ilAflTQGC: IGCIGCXTTTQ CLTIGGIGIC T^i^GGCIC 450 

^CIQEJJ^CCG C^GL'1UJ1U3 AOSaCTGAAC QGAA03QG ALUl'iULGAG 

TCrlZPGIGACjr GTI^CT33GC AriUL'iUl-iC SGSEKIGSGG* OGC'3. 1 SEQ' ID NO 1 494 
J^CAflCACICA CgiMXaCSCG TAAGCSCAAG TGCLA2^CaCC CTCG 5 ' SEQ ID NO 2 
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Figure 6 
SNP 12 

tx3sgaggaca. agggacmett ceaagtcacc qgaaagact 50 
3 ' tbgtitiggg aciuiccigt /tcoctgeaaa. ggticagiqg gictticiga. 

GGAGTGICCT ClCTlUftAM? OCAAIGGICT Tl'l'l'lLUl'iA CltXMTTOOC 100 
GCTCACAQGA. GAGAACITEA. G3ITAOC2GA. AAAAAGGAAT GaGGTAAGGG • 

. TAACftTIGIG GQGrDftGAAM? AAAGTICAAA. GACCTICaGA ACIQGCOOCA 150 

i ^i ' m i ^ fiCflc aaerac mR , tivlca agtit ciGGaaoiCT TCaooGOGGr 

QCICCTCXDCT CITCAGCIGA. TCTOOCCAAG AAAACIGCftG GftTBGACICT 200 
GGAGGAGQGA GAftGlUGACT AGAGO(iJJ."TC TITIGAGGIC CTftTCIGPCA 

GAAGCITAbC IGS doCftOCT efta GL 'l L ' l GG TGM^OCA AGGCTICAGC 250 
CTIOSAATOS ACTO33TQGA GTTOGAGACC ACTAGTO33T TOOGfl^GTOG 




GOXtXICGT caOGC? |ctct gfctgccCcag aatr^DGSAftA 300 
CQGGGGSGCA GIGGSTgaga caacggggtc tt? ' ^ 

GICAACAGAC AL/1U1USQIG AGTDAOCIGAT AiUlGl'lCiA 350 




r m iT crer i U itr i u tcacm3Icac tcbiggagia tacacaagat 



GACAOGAACT AACAGTOCIC CICCCICIG C AGiaCCAGCC AGAGGGGCAG 400 
CIGIBCTTGA TIGICAGGAG GAGQGAGAOG TCAGGGTOGG TCICOGCGTC 

GAOGACICAA TCCCAGAGIG GCCTCACIGG GGUIUUIGGT GGCAGCAAAG 450 
CIGGIGAGTT AGQGTCICAC 03GAGIGACC OOGAGGACCA. G3U1LU1T1C 

TOGAOCIGCC TOGA TCr i T r GGG I G G SATO GCCAAACITA. ACCCAAGAGT 500 
AULTGGA03G AGGTAGAAAA ODCAOCCIAC OGGTTIGAA.T TGGGTICICA 



TITCSGTOGC TITACATEAC AGACTIAGAG AAIAGTAGAG3 ' -SEQ ID NO 3 540 . 
AAAGICAGQG AAATGIAAIG TCIGAATCIC TTft!ICA'ICIC5 ' -SEQ ID NO 4 
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FIGURE 7 
SNP 13 

5 'TIT?AAAATG AftftlCATIGC TCOCrACTIA AAG^QGTAAA qftCITCTnC 50 
3'AAATITTTAC TITfiGIAACG TICIOCJ&TT CTGA&3AAAG 

TISGftCSGftG AMCSGASCC TICftCAilQCA. C3\AHJALL'xCT CACIGAAIGT 100 
A MCIGICIC TZfiGICliftQS ASGIGEACGT CnaGTCftfiGA GIGSCITACA. 

CfitMICfiGA £GGGATCCTC AAAKFICIGC CATTOCICIC 1CC03ICACC ISO 
G lCna glCr 'lL U JUAQGRG TITE&AGA03 GI&AG3£Gft3 

CXaTITTftCA GAT^GftAAAA CIGftGCJITOG QCAGCISAA K^GGCCTGC 200 
QGfIAAAft!B3r CTMCTTTTT GACICCAAGC CECICGKCTT 



250 



ATCGTCTGAA QCTOCTAOCA C£GE&Aggaa . agttccccgg 



t^jsoSQGG Ul 'i U l UUL U: T PCCTSQGTTG ATGC2GITAT TCSO^ft CCT 300 
acg,TCCTCCC C^GAOGGGG j^fecATCOC TAQ3ICAATA. jjQZTGll rQGft. 



GGAAA£GAftG M3y2AATOGT G&GCITCAAG GATICT3X33T TITOCTCTTG- 350 

exnrrrcTic taigitacca ctcxs^gtic cij^gaacca. aa^qgws^c 

AA^CIGICCA GITAAAGAGA. CIQCAGGSGT T5«3QCftGICT KJDGAftGOCC 400' 
TTIGACAQST Cafl Q?TICIC r SOJlUJiCA AT033ICftGA TGACnOGOG 

jjOTTOIOCCT TPGACftCATC CI GCI C AICT CAA1GAGCIC 450 

1QG?£aQ3GA. M L'lUlUJa G GITACICGfiG 

ATCAACAAAG GCICAGIAOC ATCA£ZP3^AA TGIAfiCCGTC TCICTTQCA3? 500 
CG£CTCA3G3 TAGICACTIT ACATIGGCAG AGftGMGGIA 



"TxaCIBGftlG ft LalTiift aXaft. ATTftftSIAGC CACTOCJCTm G3 '-SEQ ID NO 5 541 
^GIC^rCIAC TCAAATSGIT GIOG33AAT C5 ' -SEQ ID NO 6 
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FIGURE 8 

SNP 5 



5 1 AACSOC^GIG CICAAaOGT TTIG30CA2G 50 

3' T TOIO GICa C. Ca ^j.'i.'lL ' i CA TCICSGGOGT GICICTCACC AAACOQGTAC 

CSCTGQGCT QGCOQCSQCr G^ATGGGAAG JOA£GftGAA ATlGL'iGGAA. 100 
GICa UJIULA QQ300GTEOGA CITACOCTIC TCITTCICIT T3^03=£Xrrr 

GI CTIGGXT GGRGOD O Cft JXAAGIGCAG G0QCIQC3GG AGGUiUClCT 150 
CTG?J\033GA' QjrTLULjGlGT • (JLr ' I ' ICA OGfIC GQGGftCGTCC TCGC?iGG£GA 

TCCCACT30C GQGCICACCC AOCACCTGCA. CSG1G1C03C MGUICAITG 200 
^CGCTGJOSG- GCQ3SGIQG3 1QCJ1Q3I03T GICACSGQCG TTiGCSGEAAC 



aoSIQGQCRG Q33IGCIGPA GggJiCCICC jQXCC? 5s 'lUDL'lXJGG^ 250 



TCCACCQGfIC CCC?OG£CTT 1003931032 AC03VCGCGA 

l iTiifc oa Gfr ggtccagcca tgo bcacATC loaxfiGftcc iccaqgscm 1 300 

GRftGBaXx g ccaggtcggt~a^ 33G?IGIAG AQ333ICIGG 



. ' xuiuiA TKrorociaz aascaG&GCG tctcigctcc McaozaGGm 350 



i^GSGftCACAT ATftCAGGftGG '1LCGICIQ3C ■ ASGAGGftQg TP^JJAiiLtJAT 




G?GftCIGftGC AGftC^GQGIG GICCTCAGCT TOQ32ATAIA 400 
CldlGACI03 ICIGIGGC^C CSGGftGICGZV ALXJUJIATAT 



7ITGCAT GIGGQ3GCIG GftSGQCSGAA. GAMSOggG 450 

gaagaacgia c tomua c ciTcmicrT errcroasre taltj.ll.40c 

CADC&IOSIG AflGBCGCTC CTAAGCSG&C AATO33CIGC 1G03333GSC 500 
TICIGOCCTG GATIQGICIG TB^GOOSCG ACQOCCCCIG 

GCTG^CAEAA CIG&fcGQGHT ?GG£G?GCCA G03G3CGCGZ3 * -SEQ ID NO 7 540 
GG£CIGEA!IT G&CTICOCTA TOCTCIOQST CG0CO3CG3S5 ' -SEQ ID NO 8 
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FIGURE 9 
JW1 



5'GOCIOQQCA CCCPHSPCCA ATQGft£T33A Ai'J.UUj.UL'iT AAGAIAAAAT 50 
3'GG/IGAOOOGT QQGDGftOTCT T?£ETI£ACCT TICTftSTTEA. 

GTACCTGAIC CSGOXAftTA ICnCAATIT AQGMffiCIG TATCAAAftCC 100 
CJIUJUJI'JJAT £GA£GITAAA 'lUlXJJmGAC 



CIG?g? EfiC AflG3GftCAST TOIfl^Glxfe c OCAG? |aagac tcgagtqtccl 150 




GftCICIOCIG TIOCCIGTAA ULflLTttctg agctcacagg 

ItjCTCTlgAAA TOOaJQSIC OTTO ^CltCftlTGC CTBACaTICT 200 

-aoGnaa^G a^w^q^a tcagcjtaaos gatigtmca. 

GQGGnaGAAA TAMGTICAA £G?OCTTC£G i^ITO3CX]QC'iO7D0CICnC 250 
CCOCALLL'l'lT ATITCAAGTT TCIOGAftCTC TIGAC03333 TCXaQGSGQS 

TCTICAOZra AIUiUJLUAA GAAAACTOCA Q3MSGACTC TGASGCTTAC 300 
AGAAGIGGftC Ti^GSOJULil'l'- CITnGROST ACi'lUJAASG 

CTGAGCCAO: TCAftGCICIG GTCSTCAGCC OCSGGGCCTG . 350 

GACT03GIGG - ^ITULSGftC .CSCE&GIGQG TIDCES^GIC GCTQC03GAG . 

GQOCXXXZDOS TCACQCACIC TGITGCQOCA GAMUIGAAA 400 
C033GGG?£3C AJCQQGTCAG ACAACGQQ3T ClT?OClTT TCCQGTITIC • 

. ASICAftCSgfi. CB GIlILU aGr GAGTACCIGA, Tft ilCilU r BJI ' AGftCAIGft^C 450 
TC2GITCICT GTCACAGICA CIC^EGGftCT JO£C£CA2GA 'xL'iUi&CTIG 

TftAC^ILLT QCTQQCI CiG CfCTCCXSGC C^GAGQGGCA QGftQCSCICA 500 
OSQQGftGftC GICSGC3SICG CTCTCCCCET CCIQGTCSGT 

JvrmZSGSQT G3CCIC?CIG 3'-SEQ ID NO 9 520 
TRG33ICICA C03QSGICS^C 5'-SEQ ID NO 10 
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SEQUENCE LISTING 

<110> Abreu, Maria T. 
Taylor, Kent D. 
Rotter, Jerome I. 
Yang, Huiying 
Sugimura, Kazuhito 
Targan, Stephan R. 

<120> Mutations in N0D2 are Associated with 

Fibrostenosing Disease in Patients with Crohn's Disease 



<130> P-CE 5480 

<X50> US 60/407,391 
<151> 2002-08-30 

<160> 67 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 494 

<212> DNA 

<213> Homo sapiens 

<400> 1 

accttcagat cacagcagcc ttcctggcag ggctgttgtc ccgggagcac tggggcctgc 60 
tggctgagtg ccagacatct gagaaggccc tgctccggcg ccaggcctgt gcccgctggt 120 
gtctggcccg cagcctccgc aagcacttcc actccatccc gccagctgca ccgggtgagg 180 
ccaagagcgt gcatgccatg cccgggttca tctggctcat ccggagcctg tacgagatgc 240 
aggaggagcg gctggctcgg aaggctgcac gtggcctgaa tgttgggcac ctcaa§ttga 300 
cattttgcag tgtgggcccc actgagtgtg ctgccctggc ctttgtgctg cagcacctcc 360 
ggcggcccgt ggccctgcag ctggactaca actctgtggg tgacattggc ctggagcagc 420 
tgctgccttg ccttggtgtc tgcaaggctc tgtagtgagt gttactgggc attgctgttc 480 
aggtatgggg gage 494 

<210> 2 

<211> 494 

<212> DNA 

<213> Homo sapiens 

<400> 2 

gctcccccat acctgaacag caatgcccag taacactcac tacagagect tgcagacacc 60 
aaggcaaggc ageagctget ccaggccaat gtcacccaca gagttgtagt ccagctgcag 120 
ggccacgggc cgccggaggt gctgcagcac aaaggecagg gcagcacact cagtggggcc 180 
cacactgcaa aatgtcaact tgaggtgccc aacattcagg ccacgtgcag ccttccgagc 240 
cagccgctcc tcctgcatct egtacagget ceggatgage cagatgaacc egggcatgge 300 
atgcacgctc ttggcctcac ccggtgcagc tggcgggatg gagtggaagt gettgeggag 360 
gctgcgggcc agacaccagc gggcacaggc ctggcgccgg ageagggect tctcagatgt 420 




- 2 - 



ctggcactca gccagcaggc cccagtgctc ccgggacaac agccctgcca ggaaggctgc 480 
tgtgatctga aggt 494 

<210> 3 

<211> 540 

<212> DNA 

<213> Homo sapiens 



<400> 3 

atcaaaaccc tgagaggaca agggacattt 
ctcttgaaat ccaatggtct tttttcctta 
aaagttcaaa gaccttcaga actggcccca 
aaaactgcag gatagactct gaagcttacc 
aggcttcagc cagggcctgg gccccctcgt 
ggccaaaaga gtcaacagac agtgtcagtg 
aacagtcctc ctccctctgc agtcccagcc 
gcctcactgg ggctcctggt cccagcaaag 
gccaaactta acccaagagt tttcagtggc 



ccaagtcacc cagaaagact cgagtgtcct 60 
ctccattgcc taacattgtg gggtagaaat 120 
gctcctccct cttcacctga tctccccaag 180 
tgagccacct caagctctgg tgatcaccca 240 
cacccactct gttgccccag aatctgaaaa 300 
agtacctgat atgtgttcta gacatgaact 360 
agaggggcag gaccactcaa tcccagagtg 420 
tggacctgcc tccatctttt gggtgggatg 480 
tttacattac agacttagag aatagtagag 540 



<210> 4 

<211> 540 

<212> DNA 

<213> Homo sapiens 

<400> 4 

ctctactatt ctctaagtct gtaatgtaaa 
catcccaccc aaaagatgga ggcaggtcca 
cactctggga ttgagtggtc ctgcccctct 
agttcatgtc tagaacacat atcaggtact 
ttttcagatt ctggggcaac agagtgggtg 
tgggtgatca ccagagcttg aggtggctca 
cttggggaga tcaggtgaag agggaggagc 
atttctaccc cacaatgtta ggcaatggag 
aggacactcg agtctttctg ggtgacttgg 



gccactgaaa actcttgggt taagtttggc 60 
ctttgctggg accaggagcc ccagtgaggc 120 
ggctgggact gcagagggag gaggactgtt 180 
cactgacact gtctgttgac tcttttggcc 240 
acgagggggc ccaggccctg gctgaagcct 300 
ggtaagcttc agagtctatc ctgcagtttt 360 
tggggccagt tctgaaggtc tttgaacttt 420 
taaggaaaaa agaccattgg atttcaagag 480 
aaatgtccct tgtcctctca gggttttgat 540 



<210> 5 

<211> 541 

<212> DNA 

<213> Homo sapiens 



<400> 5 

tttaaaaatg aaatcattgc tccctactta 
aatcagatcc ttcacatgca gaatcattct 
aaaattctgc cattcctctc tcccgtcacc 
gagagctaaa acaggcctgc ccaggggcct 
tcaaggggcc tgcaggaggg cttctgcccc 
ggaaaagaag atacaatggt gagcttcaag 
gttaaagaga ctgcaggagt tagccagtct 
ctgctcatgt ctgagattcc caatgagctc 
tgtaaccgtc tctcttccat tcactagatg 



aagaggtaaa gacttctttc ttagacagag 60 
cactgaatgt cagaatcaga agggatcctc 120 
ccattttaca gatagaaaaa ctgaggttcg 180 
taccagactt ccaggatggt gtcattcctt 240 
taggtaggtg atgcagttat tggacaacct 300 
gattcttggt tttcctcttg aaactgtcca 360 
actgaagccc acctgtccct tagacacatc 420 
atcaacaaag gctcagtacc atcagtgaaa 480 
agtttatcaa attaagtagc cactccctta 540 

541 
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<210> 6 

<211> 541 

<212> DNA 

<213> Homo sapiens 

<400> 6 

ctaagggagt ggctacttaa tttgataaac tcatctagtg aatggaagag agacggttac 60 
atttcactga tggtactgag cctttgttga tgagctcatt gggaatctca gacatgagca 120 
ggatgtgtct aagggacagg tgggcttcag tagactggct aactcctgca gtctctttaa 180 
ctggacagtt tcaagaggaa aaccaagaat ccttgaagct caccattgta tcttcttttc 240 
caggttgtcc aataactgca tcacctacct aggggcagaa gccctcctgc aggccccttg 300 
aaaggaatga caccatcctg gaagtctggt aaggcccctg ggcaggcctg ttttagctct 360 
ccgaacctca gtttttctat ctgtaaaatg gggtgacggg agagaggaat ggcagaattt 420 
tgaggatccc ttctgattct gacattcagt gagaatgatt ctgcatgtga aggatctgat 480 
tctctgtcta agaaagaagt ctttacctct ttaagtaggg agcaatgatt tcatttttaa 540 

541 



<210> 7 

<211> 540 

<212> DNA 

<213> Homo sapiens 

<400> 7 

aacagcagtg ctcaaagagt agagtccgca 
gccggcagct gaatgggaag acaaagagaa 
gcaagtgcag ccgctgcagg agcgtgctct 
cagtgtccgc atcgtcattg aggtggccag 
tggctgggct cttctgcggg ggtccagcca 
tctctgtgta tatgtcctcc aggcagagcg 
gagactgagc agacaccgtg gtcctcagct 
gaaggcagaa gaagaggcag atgaaggtgg 
aatgggctgc tgcgggggac gctgacataa 



cagagagtgg tttggccatg cactgcagct 60 
attcctggaa gtcttgccct gcagcccaca 120 
tgccactgcc cgcctcaccc accaccagca 180 
gggtgctgaa gagctcctcc aggcccaggg 240 
tgcccacatc tgcccagacc tccaggacat 300 
tctctgctcc atcataggta ctgaggaagc 360 
tggccatata cttcttgcat gtggcagctg 420 
caccatggtg aagacgggac ctaaccagac 480 
ctgaagggat aggagagcca gcgggcgccc 540 



<210> 8 

<211> 540 

<212> DNA 

<213> Homo sapiens 

<400> 8 

gggcgcccgc tggctctcct atcccttcag 
gtctggttag gtcccgtctt caccatggtg 
cagctgccac atgcaagaag tatatggcca 
gcttcctcag tacctatgat ggagcagaga 
atgtcctgga ggtctgggca gatgtgggca 
ccctgggcct ggaggagctc ttcagcaccc 
tgctggtggt gggtgaggcg ggcagtggca 
tgtgggctgc agggcaagac ttccaggaat 
agctgcagtg catggccaaa ccactctctg 



ttatgtcagc gtcccccgca gcagcccatt 60 
ccaccttcat ctgcctcttc ttctgccttc 120 
agctgaggac cacggtgtct gctcagtctc 180 
cgctctgcct ggaggacata tacacagaga 240 
tggctggacc cccgcagaag agcccagcca 300 
ctggccacct caatgacgat gcggacactg 360 
agagcacgct cctgcagcgg ctgcacttgc 420 
ttctctttgt cttcccattc agctgccggc 480 
tgcggactct actctttgag cactgctgtt 540 



<210> 9 
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<211> 520 
<212> DNA 
<213> Homo sapeins 

<400> 9 * 
gcactgggca cccactacca atggattgga attggtcctt aagataaaat gtacctgatc 60 
cagcccaata tcttcaattt acagatactg tatcaaaacc ctgagaggac aagggacatt 120 
tccaagtcac ccagaaagac tcgagtgtcc tctcttgaaa tccaatggtc ttttttcctt 180 
actccattgc ctaacattgt ggggtagaaa taaagttcaa agaccttcag aactggcccc 240 
agctcctccc tcttcacctg atctccccaa gaaaactgca ggatagactc tgaagcttac 300 
ctgagccacc tcaagctctg gtgatcaccc aaggcttcag ccagggcctg ggccccctcg 360 
tcacccactc tgttgcccca gaatctgaaa aggccaaaag agtcaacaga cagtgtcagt 420 
gagtacctga tatgtgttct agacatgaac taacagtcct cctccctctg cagtcccagc 480 
cagaggggca ggaccactca atcccagagt ggcctcactg 520 



<210> 10 

<211> 520 

<212> DNA 

<213> Homo sapiens 



<400> 10 

cagtgaggcc actctgggat tgagtggtcc tgcccctctg gctgggactg cagagggagg 60 
aggactgtta gttcatgtct agaacacata tcaggtactc actgacactg tctgttgact 120 
cttttggcct tttcagattc tggggcaaca gagtgggtga cgagggggcc caggccctgg 180 
ctgaagcctt gggtgatcac cagagcttga ggtggctcag gtaagcttca gagtctatcc 240 
tgcagttttc ttggggagat caggtgaaga gggaggagct ggggccagtt ctgaaggtct 300 
ttgaacttta tttctacccc acaatgttag gcaatggagt aaggaaaaaa gaccattgga 36a 
tttcaagaga ggacactcga gtctttctgg gtgacttgga aatgtccctt gtcctctcag 420 
ggttttgata cagtatctgt aaattgaaga tattgggctg gatcaggtac attttatctt 480 
aaggaccaat tccaatccat tggtagtggg tgcccagtgc 520 

<210> 11 

<211> 535 

<212> DNA 

<213> Homo sapiens 

<400> 11 ^ n 
tcactaacca gctcaggaag ctcaccagct tgggaagtta atcattatgt ctagcttcag 60 
tttctcctgc ttcagtttaa attgggaaag agagagaaaa aatattcact cattatctgt 120 
ttcctaaaat tgtccttaac atccttcctc ttactccttt attacctggt cgggcttccc 180 
ctcttcaggc gaaatctgtc agtctatctg cattgccttt tgatctctac ttcagttact 240 
acaacttcaa agacaccatt gtcctcccca aggtgaggcc catgtagaga aaggatcact 300 
tccttgctga aagagagggt caaggggcga cccacgtggg ccctccctga aacccaggcc 360. 
caggcctgag cctggacacc tccttccttc ctgagaccac agccagcccg gtttctctgg 420 
ggccaagagc aaatgctttg cttaagtgct gaaatctcag cccactgacc ccttgcagac 480 
aggagaggag gggaagccca gggaagctca acttcccaag tgtcctgagt ctctg 535 

<210> 12 

<211> 496 

<212> DNA 

<213> Homo sapiens 



<400> 12 
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aatcattatg tctagcttca gtttctcctg cttcagttta aattgggaaa gagagagaaa 60 
aaatattcac tcattatctg tttcctaaaa ttgtccttaa catccttcct cttactcctt 120 
tattacctgg tcgggcttcc cctcttcagg cgaaatctgt cagtctatct gcattgcctt 180 
ttgatctcta cttcagttac tacmacttca aagacaccat tgtcctcccc aaggtgaggs 240 
ccatgtagag aaaggatcac ttccttgctg aaagagaggg tcaaggggtg acccacgtgg 300 
gccctccctg aaacccaggc ccaggcctga gcctggacac ctccttcctt ectgagacca 360 
cagccagccc ggtttctctg gggccaagag caaatgcttt gcttaagtgc tgaaatctca 420 
gcccactgac cccttgcmga caggagagga ggggaagccc agggaagctc aacttcccaa 480 
gtgtcctgag tctctg 496 

<210> 13 

<211> 488 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 434 

<223> n « A,T,C or G 



tgtctagctt cagtttctcc tgcttcagtt taaattggga aagagagaga aaaaatattc 60 

actcattatc tgtttcctaa aattgtcctt aacatccttc ctcttactcc tttattacct 120 

ggtcgggctt cccctcttca ggcgaaatct gtcagtctat ctgcattgcc ttttgatctc 180 

tacfctcagtt actacaactt caaagacacc attgtcctcc ccaaggtgag gcccatgtag 240 

agaaaggatc acttccttgc tgaaagagag ggtcaagggg tgacccacgt gggccctccc 300 

tgaaacccag gcccaggcct gagcctggac acctccttcc ttcctgagac cacagccagc 360 

ccggtttctc tggggccaag agcaaatgct ttgcttaagt gctgaaatct cagcccactg 420 

amcccttgca gacnggagag gaggggaagc ccagggaagc tcaacttccc aagtgtcctg 480 
agtctctg 488 

<210> 14 

<211> 491 

<212> DNA 

<213> Homo sapiens 



<222> 437 

<223> n = A,T,C or G 
<400> 14 

ttatgtctag cttcagtttc tcctgcttca gtttaaattg ggaaagagag agaaaaaata 60 

ttcactcatt atctgtttcc taaaattgtc cttaacatcc ttcctcttac tcctttatta 120 

cctggtcggg cttcccctct tcaggcgaaa tctgtcagtc tatctgcatt gccttttgat 180 

ctctacttca gttactacaa cttcaaagac accattgtcc tccccaaggt gaggcccatg 240 

tagagaaagg atcacttcct tgctgaaaga gagggtcaag gggygaccca cgtgggccct 300 

ccctgaaacc caggcccagg cctgagcctg gacacctcct tccttcctga gaccacagcc 360 

agcccggttt ctctggggcc aagagcaaat gctttgctta agtgctgaaa tctcagccca 420 

ctgacccctt gcagacngga gaggagggga agcccaggga agctcaactt cccaagtgtc 480 

ctgagtctct g 491 



<400> 13 



<220> 

<221> misc 



feature 



<210> 15 
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<211> 491 
<212> DNA 
<213> Homo sapiens 

<400> 15 

ttatgtctag cttcagtttc tcctgcttca 
ttcactcatt atctgtttcc taaaattgtc 
cctggtcggg cttcccctct tcaggcgaaa 
ctctacttca gttactacaa cttcaaagac 
tagagaaagg atcacttcct tgctgaaaga 
ccctgaaacc caggcccagg cctgagcctg 
agcccggttt ctctggggcc aagagcaaat 
ctgacccctt gcagacagga gaggagggga 
ctgagtctct g 

<210> 16 

<211> 491 

<212> DNA 

<213> Homo sapiens 



gtttaaattg ggaaagagag agaaaaaata 60 
cttaacatcc ttcctcttac tcctttatta 120 
tctgtcagtc tatctgcatt gccttttgat 180 
accattgtcc tccccaaggt gaggcccatg 240 
gagggtcaag gggygaccca cgtgggccct 300 
gacacctcct tccttcctga gaccacagcc 360 
gctttgctta agtgctgaaa tctcagccca 420 
agcccaggga agctcaactt cccaagtgtc 480 

491 



<220> 

<221> misc_feature 
<222> 437 

<223> n = A,T,C or G 



<400> 16 

ttatgtctag cttcagtttc tcctgcttca 
ttcactcatt atctgtttcc taaaattgtc 
cctggtcggg cttcccctct tcaggcgaaa 
ctctacttca gttactacaa cttcaaagac 
tagagaaagg atcacttcct tgctgaaaga 
ccctgaaacc caggcccagg cctgagcctg 
agcccggttt ctctggggcc aagagcaaat 
ctgacccctt gcagacngga gaggagggga 
ctgagtctct g 



gtttaaattg ggaaagagag agaaaaaata 60 
cttaacatcc ttcctcttac tcctttatta 120 
tctgtcagtc tatctgcatt gccttttgat 180 
accattgtcc . tccccaaggt gaggcccatg 240 
gagggtcaag gggygaccca cgtgggccct 300 
gacacctcct tccttcctga gaccacagcc 360 
gctttgctta agtgctgaaa tctcagccca 420 
agcccaggga agctcaactt cccaagtgtc 480 

491 



<210> 17 

<211> 491 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_£eature 
<222> 159 

<323> n = A,T,C or G 



<400> 17 

ttatgtctag cttcagtttc tcctgcttca 
ttcactcatt atctgtttcc taaaattgtc 
cctggtcggg cttcccctct tcaggcgaaa 
ctctacttca gttactacaa cttcaaagac 
tagagaaagg atcacttcct tgctgaaaga 
ccctgaaacc caggcccagg cctgagcctg 



gtttaaattg ggaaagagag agaaaaaata 60 
cttaacatcc ttcctcttac tcctttatta 120 
tctgtcagnc tatctgcatt gccttttgat 180 
accattgtcc tccccaaggt gaggcccatg 240 
gagggtcaag gggygaccca cgtgggccct 300 
gacacctcct tccttcctga gaccacagcc 360 



agcccggttt ctctggggcc aagagcaaat gctttgctta agtgctgaaa tctcagccca 420 
ctgacccctt gcagacagga gaggagggga agcccaggga agctcaactt cccaagtgtc 480 
ctgagtctct g 



<210> 18 

<211> 487 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 433 

<223> n = A,T,C or G 



<400> 18 

gtctagcttc agtttctcct gcttcagttt 
ctyattatct gtttcctaaa attgtcctta 
gtcgggcttc ccctcttcag gcgaaatctg 
acttcagtta ctacaacttc aaagacacca 
gaaaggatca cttccttgct gaaagagagg 
gaaacccagg cccaggcctg agcctggaca 
cggtttctct ggggccaaga gcaaatgctt 
ccccttgcag acnggagagg aggggaagcc 
gtctctg 

<210> 19 

<211> 486 

<212> DNA 

<213> Homo sapiens 



aaattgggaa agagagagaa aaaatattca 60 

acatccttcc tcttactcct ttattacctg 120 

tcagtctatc tgcattgcct tttgatctct 180 

ttgtcctccc caaggtgagg cccatgtaga 240 

gtcaaggggy gacccacgtg ggccctccct 300 

cctccttcct tcctgagacc acagccagcc 360 

tgcttaagtg ctgaaatctc agcccactga 420 

cagggaagct caacttccca agtgtcctga 480 

487 



<220> 

<22l> misc_feature 
<222> 432 

<223> n = A,T,C or G 



<400> 19 

tctagcttca gtttctcctg cttcagttta 
tcattatctg tttcctaaaa ttgtccttaa 
tcgggcttcc cctcttcagg cgaaatctgt 
cttcagttac tacaacttca aagacaccat 
aaaggatcac ttccttgctg aaagagaggg 
aaacccaggc ccaggcctga gcctggacac 
ggtttctctg gggccaagag caaatgcttt 
cccttgcaga cnggagagga ggggaagccc 
tctctg 

<210> 20 

<211> 484 

<212> DNA 

<213> Homo sapiens 



aattgggaaa gagagagaaa aaatattcac 60 
catccttcct cttactcctt tattacctgg 120 
cagtctatct gcattgcctt ttgatctcta 180 
tgtcctcccc aaggtgaggc ccatgtagag 240 
tcaaggggcg acccacgtgg gccctccctg 300 
ctccttcctt cctgagacca cagccagccc 360 
gcttaagtgc tgaaatctca gcccactgac 420 
agggaagctc aacttcccaa gtgtcctgag 480 

486 



<220> 

<221> misc feature 



^ 103^36 ."013003 



<222> 430 

<223> n o A,T,C or G 



<400> 20 

tagcttcagt ttctcctgct tcagtttaaa 
attatctgtt tcctaaaatt gtccttaaca 
gggcttcccc tcttcaggcg aaatctgtca 
tcagttacta caacttcaaa gacaccattg 
aggatcactt ccttgctgaa agagagggtc 
acccaggccc aggcctgagc ctggacacct 
tttctctggg gccaagagca aatgctttgc 
cttgcagacn ggagaggagg ggaagcccag 
tctg 



ttgggaaaga gagagaaaaa atattcactc 60 
tccttcctct tactccttta ttacctggtc 120 
gtctatctgc attgcctttt gatctctact 180 
tcctccccaa ggtgaggccc atgtagagaa 240 
aaggggcgac ccacgtgggc cctccctgaa 300 
ccttccttcc tgagaccaca gccagcccgg 360 
ttaagtgctg -aaatctcagc ccactgaccc 420 
ggaagctcaa cttcccaagt gtcctgagtc 480 

484 



<210> 21 

<211> 485 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 431 

<223> n = A,T,C or G 
<400> 21 

ctagcttcag tttctcctgc ttcagtttaa 
yattatctgt ttcctaaaat tgtccttaac 
cgggcttccc ctcttcaggc gaaatctgtc 
ttcagttact acaacttcaa agacaccatt 
aaggatca.ct tccttgctga aagagagggt 
aacccaggcc caggcctgag cctggacacc 
gtttctctgg ggccaagagc aaatgctttg 
ccttgcagac nggagaggag gggaagccca 
ctctg 



attgggaaag agagagaaaa aatattcact 60 
atccttcctc ttactccttt attacctggt 120 
agtctatctg cattgccttt tgatctctac 180 
gtcctcccca aggtgaggcc catgtagaga 240 
caaggggcga cccacgtggg ccctccctga 300 
tccttccttc ctgagaccac agccagcccg 360 
cttaagtgct gaaatctcag cccactgacc 420 
gggaagctca acttcccaag tgtcctgagt 480 

485 



<210> 22 

<211> 488 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 434 

<223> n = A,T,C or G 
<400> 22 

tgtctagctt cagtttctcc tgcttcagtt 
acttattatc tgtttcctaa aattgtcctt 
ggtcgggctt cccctcttca ggcgaaatct 
tacttcagtt actacaactt caaagacacc 
agaaaggatc acttccttgc tgaaagagag 
tgaaacccag gcccaggcct gagcctggac 
ccggtttctc tggggccaag agcaaatgct 



taaattggga aagagagaga aaaaatattc .60 
aacatccttc ctcttactcc tttattacct 120 
gtcagtctat ctgcattgcc ttttgatctc 180 
attgtcctcc ccaaggtgag gcccatgtag 240 
ggtcaagggg cgacccacgt gggccctccc 300 
acctccttcc ttcctgagac cacagccagc 360 
ttgcttaagt gctgaaatct cagcccactg 420 
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accccttgca gacnggagag gaggggaagc ccagggaagc tcaacttccc aagtgfccctg 480 



<210> 23 
<211> 488 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 434 

<223> n = A,T,C or G 
<400> 23 

tgtctagctt cagtttctcc tgcttcagtt taaattggga aagagagaga aaaaatattc 60 
acttattatc tgtttcctaa aattgtcctt aacatccttc ctcttactcc tttattacct 120 
ggtcgggctt cccctcttca ggcgaaatct gtcagtctat ctgcattgcc ttttgatctc X80 
tacttcagtt actacaactt caaagacacc attgtcctcc ccaaggtgag gcccatgtag 240 
agaaaggatc acttccttgc tgaaagagag ggtcaagggg cgacccacgt gggccctccc 300 
tgaaacccag gcccaggcct gagcctggac acctccttcc ttcctgagac cacagccagc 360 
ccggtttctc tggggccaag agcaaatgct ttgcttaagt gctgaaatct cagcccactg 420 
accccttgca gacnggagag gaggggaagc ccagggaagc tcaacttccc aagtgtcctg 480 
agtctctg 488 

<210> 24 
<211> 497 
<212> DNA 

<213> Homo sapiens 



tcactaggct tctggttgat gcctgtgaac tgaactctga caacagactt ctgaaataga 60 

cccacaagag gcagttccat ttcatttgtg ccagaatgct ttaggatgta cagttatgga 120 

ttgaaagttt acaggaaaaa aaattaggcc gttccttcaa agcaaatgtc ttcctggatt 18 0 

attcaaaatg atgtatgttg aagcctttgt aaattgtcag atgctgtgca aatgttatta 240 

ttttaaacat tatgatgtgt gaaaacfcggt taatatttat aggtcacttt gttttactgt 300 

cttaagttta tactcttata gacaacatgg ccgtgaactt tatgctgtaa ataatcagag 360 

gggaataaac tgttgagtca aaacagccat cttccttgtg accaaacatt taaaaatatt 420 

ctggctgggc acagtggctc acgcctgtaa tcccagcact ttgggaggcc gaggtgggca 48 0 

gatcacctga ggttggg 497 

<210> 25 
<211> 460 
<212> DNA 

<213> Homo sapiens 



tgacaacaga cttctgaaat agacccacaa gaggcagttc catttcattt gtgccagaat 60. 

gctttaggat gtacagttat ggattgaaag tttacaggaa aaaaaattag gccgttcctt 12 0 

caaagcaaat gtcttcctgg attattcaaa atgatgtatg ttgaagcctt tgtaaattgt 180 

cagatgctgt gcaaatgtta fctattttaaa cattatgatg tgtgaaaact ggttaatatt 240 

tataggtcac tttgttttac tgtcttaagt ttatactctt atagacaaca tggccgtgaa 300 

ctttatgctg taaataatca gaggggaata aactgttgag tcaaaacagc catcttcctt 360 

gtgaccaaac atttaaaaat- attctggctg ggcacagtgg ctcacgcctg taatcccagc 420 



agtctctg 



488 



<400> 24 



<400> 25 
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actttgggag gccgaggtgg gcagatcacc tgaggttggg 460 

<210> 26 

<211> 462 

<2X2> DNA 

<213> Homo sapiens 

<400> 26 

tctgacaaca gacttctgaa atagacccac aagaggcagt tccatttcat ttgtgccaga 60 
atgctttagg atgtacagtt atggattgaa agtttacagg aaaaaaaatt aggccgttcc 120 
ttcaaagcaa atgtcttcct ggattattca aaatgatgta tgttgaagcc tttgtaaatt 180 
gtcagatgct gtgcaaatgt tattatttta aacattatga tgtgtgaaaa ctggttaata 240 
tttataggtc actttgtttt actgtcttaa gtttatactc ttatagacaa catggccgtg 300 
aactttatgc tgtaaataat cagaggggaa taaactgttg agtcaaaaca gccatcttcc 360 
ttgtgaccaa acatttaaaa atattctggc tgggcacagt ggctcacgcc tgtaatccca 420 
gcactttggg aggccgaggt gggcagatca cctgaggttg gg 462 

<210> 27 

<211> 459 

<212> DNA 

<213> Homo sapiens 



<400> 27 

gacaacagac ttctgaaata gacccacaag 
ctttaggatg tacagttatg gattgaaagt 
aaagcaaatg tcttcctgga ttattcaaaa 
agatgctgtg caaatgttat tattttaaac 
ataggtcact. ttgttttact gtcttaagtt 
tttatgctgt aaataatcag aggggaataa 
tgaccaaaca tttaaaaata ttctggctgg 
ctttgggagg ccgaggtggg cagatcacct 

<210> 28 

<211> 467 

<212> DNA 

<213> Homo sapiens 



aggcagttcc atttcatttg tgccagaatg 60 
ttacaggaaa aaaaattagg ccgttccttc 120 
tgatgtatgt tgaagccttt gtaaattgtc 180 
attatgatgt gtgaaaactg gttaatattt 240 
tatactctta tagacaacat ggccgtgaac 300 
actgttgagt caaaacagcc atcttccttg 360 
gcacagtggc tcacgcctgt aatcccagca 420 
gaggttggg 459 



<400> 28 

tgaactctga caacagactt ctgaaataga cccacaagag gcagttccat ttcatttgtg 60 
ccagaatgct ttaggatgta cagttatgga ttgaaagttt acaggaaaaa aaattaggcc 120 
gttccttcaa agcaaatgtc ttcctggatt attcaaaatg atgtatgttg aagcctttgt 180 
aaattgtcag atgctgtgca aatgttatta ttttaaacat tatgatgtgt gaaaactggt 240 
taatatttat aggtcacttt gttttactgt cttaagttta tactcttata gacaacatgg 300 
ccgtgaactt tatgctgtaa ataatcagag gggaataaac tgttgagtca aaacagccat 360 
cttccttgtg accaaacatt taaaaatafct ctggctgggc acagtggctc acgcctgtaa 420 
tcccagcact ttgggaggcc gaggtgggca gatcacctga ggttggg 467 

i 

<210> 29 

<211> 467 

<212> DNA 

<213> Homo sapiens 



<400> 29 



11 



tgaactctga caacagactt ctgaaataga cccacaagag gcagttccat ttcatttgtg 60 
ccagaatgct ttaggatgta cagttatgga ttgaaagttt acaggaaaaa aaattaggcc 120 
gttccttcaa agcaaatgtc ttcctggatt attcaaaatg atgtatgttg aagcctttgt 180 
aaafctgtcag atgctgtgca aatgttatta ttttaaacat tatgatgtgt gaaaactggt 240 
taatatttat agrtcacttt gttttactgt cttaagttta tactcttata gacaacatgg 300 
ccgtgaactt tatgctgtaa ataatcagag gggaataaac tgttgagtca aaacagccat 360 
cttccttgtg accaaacatt taaaaatatt ctggctgggc acagtggctc acgcctgtaa 420 
tcccagcact ttgggaggcc gaggtgggca gatcacctga ggttggg 467 



<210> 30 

<211> 466 

<212> DNA 

<213> Homo sapiens 



<400> 30 

gaactatgac aacagacttc tgaaatagac 
cagaatgctt taggatgtac agttatggat 
ttccttcaaa gcaaatgtct tcctggatta 
aattgtcaga tgctgtgcaa atgttattat 
aatatttata grtcactttg ttttactgtc 
cgtgaacttt atgctgtaaa taatcagagg 
ttccttgtga ccaaacattt aaaaatattc 
cccagcactt tgggaggccg aggtgggcag 



ccacaagagg cagttccatt tcatttgtgc 60 
tgaaagttta caggaaaaaa aattaggccg 120 
ttcaaaatga tgtatgttga agcctttgta 180 
tttaaacatt atgatgtgtg aaaactggtt 240 
ttaagtttat actcttatag acaacatggc 300 
ggaataaact gttgagtcaa aacagccatc 360 
tggctgggca cagtggctca cgcctgtaat 420 
atcacctgag gttggg 46 6 



<210> 31 

<211> 466 

<212> DNA 

<213> Homo sapiens 



<400> 31 

gaactctgac aacagacttc tgaaatagac 
cagaatgctt taggatgtac agttatggat 
ttccttcaaa gcaaatgtct tcctggatta 
aattgtcaga tgctgtgcaa atgttattat 
aatatttata grtcactttg ttttactgtc 
cgtgaacttt atgctgtaaa taatcagagg 
ttccttgtga ccaaacattt aaaaatattc 
cccagcactt tgggaggccg aggtgggcag 

<210> 32 

<211> 460 

<212> DNA 

<213> Homo sapiens 



ccacaagagg cagttccatt tcatttgtgc 60 
tgaaagttta caggaaaaaa aattaggccg 120 
ttcaaaatga tgtatgttga agcctttgta 180 
tttaaacatt atgatgtgtg aaaactggtt 240 
ttaagtttat actcttatag acaacatggc 300 
ggaataaact gttgagtcaa aacagccatc 360 
tggctgggca cagtggctca cgcctgtaat 420 
atcacctgag gttggg 466 



<400> 32 

tgacaacaga cttctgaaat agacccacaa 
gctttaggat gtacagttat ggattgaaag 
caaagcaaat gtcttcctgg attattcaaa 
cagatgctgt gcaaatgtta ttattttaaa 
tatagrtcac tttgttttac tgtcttaagt 
ctttatgctg taaataatca gaggggaata 
gtgaccaaac atttaaaaat attctggctg 
actttgggag gccgaggtgg gcagatcacc 



ga^gcagttc catttcattt gtgccagaat 60 
tttacaggaa aaaaaattag gccgttcctt 120 
atgatgtatg ttgaagcctt tgtaaattgt 180 
cattatgatg tgtgaaaact ggttaatatt 24 0 
ttatactctt atagacaaca tggccgtgaa 300 
aactgttgag tcaaaacagc catcttcctt 360 
ggcacagtgg ctcacgcctg taatcccagc 420 
tgaggttggg 460 




12 



<210> 33 
<211> 467 
<212> DNA 

<213> Homo sapiens 



<400> 33 

tgaactctga caacagactt ctgaaataga cccacaagag gcagttccat ttcatttgtg 60 
ccagaatgct ttaggatgta cagttatgga ttgaaagttt acaggaaaaa aaattaggcc 120 
gttccttcaa agcaaatgtc ttcctggatt attcaaaatg atgtatgttg aagcctttgt 180 
aaattgtcag atgctgtgca aatgttatta ttttaaacat tatgatgtgt gaaaactggt 240 
taatatttat aggtcacttt gttttactgt cttaagttta tactcttata gacaacatgg 300 
ccgtgaactt tatgctgtaa ataatcagag gggaataaac tgttgagtca aaacagccat 360 
cttccttgtg accaaacatt taaaaatatt ctggctgggc acagtggctc acgcctgtaa 420 
tcccagcact ttgggaggcc gaggtgggca gatcacctga ggttggg 467 



<210> 34 
<211> 460 
<212> DNA 

<213> Homo sapiens 



<400> 34 

tgacaacaga c ttc tgaaat agacccacaa 
gctttaggat gtacagttat ggattgaaag 
caaagcaaat gtcttcctgg attattcaaa 
cagatgctgt gcaaatgtta ttattttaaa 
tatagatcac tttgttttac tgtcttaagt 
ctttatgctg taaataatca gaggggaata 
gtgaccaaac atttaaaaat attctggctg 
actttgggag gccgaggtgg gcagatcacc 

<210> 35 
<211> 462 
<212> DNA 

<213> Homo sapiens 



gaggcagttc catttcattt gtgccagaat 60 
tttacaggaa aaaaaattag gccgttcctt 120 
atgatgtatg ttgaagcctt tgtaaattgt 180 
cattatgatg tgtgaaaact ggttaatatt 240 
ttatactctt atagacaaca tggccgtgaa 300 
aactgttgag tcaaaacagc catcttcctt 360 
ggcacagtgg ctcacgcctg taafccccagc 420 
tgaggttggg 460 



<400> 35 

tctgacaaca gacttctgaa atagacccac 
atgctttagg atgtacagtt atggattgaa 
ttcaaagcaa atgtcttcct ggattattca 
gtcagatgct gtgcaaatgt tattatttta 
tttatagatc actttgtttt actgtcttaa 
aactttatgc tgtaaataat cagaggggaa 
ttgtgaccaa acatttaaaa atattctggc 
gcactttggg aggccgaggt gggcagatca 

<210> 36 

<211> 463 

<212> DNA 

<213> Homo sapiens 



aagaggcagt tccatttcat ttgtgccaga 60 
agtttacagg aaaaaaaatt aggccgttcc 120 
aaatgatgta tgttgaagcc tttgtaaatt 180 
aacattatga tgtgtgaaaa ctggttaata 240 
gtttatactc ttatagacaa catggccgtg 300 
taaactgttg agtcaaaaca gccatcttcc 360 
tgggcacagt ggctcacgcc tgtaatccca 420 
cctgaggttg gg 462 



<400> 36 

ctctgacaac agacttctga aatagaccca caagaggcag ttccatttca tttgtgccag 




JLD3S. 
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agcactttgg gaggccgagg tgggcagatc acctgaggtt ggg 

<210> 37 
<211> 17 
<212> DNA 

<213> Homo sapiens 
<400> 37 

ggtggctggg ctcttct 

<210> 38 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 38 

ctcgcttcct cagtacctat gatg 

<210> 39 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 39 

ctggctgagt gccagacatc t 

<210> 40 

<211> 17 

<212> DNA 

<213> Homo sapiens 

<400> 40 

ggcgggatgg agtggaa 

<210> 41 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 41 

ccacctcaag ctctggtgat c 

<210> 42 

<211> 23 

<212> DNA 

<213> Homo sapiens 




1035; 
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<400> 42 

gttgactctt ttggcctttt cag 

<2X0> 43 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 43 

ccttaccaga cttccaggat ggt 

<210> 44 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 44 

tgtccaataa ctgcatcacc tacct 

<210> 45 
<211> 13 
<212> DNA 

<213> Homo sapiens 

<400> 45 
catggctgga ccc 

<210> 46 
<211> 13 
<212> DNA 

<213> Homo sapiens 

<400> 46 
catggctgga tec 

<210> 47 
<211> 13 
<212> DNA 

<213> Homo sapiens 

<400> 47 
tgctccggcg cca 

<210> 48 
<211> 14 
<212> DNA 

<213> Homo sapiens 

<400> 48 
ctgctctggc geca 



<210> 49 
<211> 16 




15 



<212> DNA 
<213> Homo 



sapiens 



<400> 49 

ctctgttgcc ccagaa 



16 



<210> 50 
<211> 15 
<212> DNA 

<213> Homo sapiens 



<400> 50 

ctctgttgcg ccaga 



15 



<210> 51 
<211> 15 
<212> DNA 

<213> Homo sapiens 



<400> 51 

ctttcaaggg cctgc 



15 



<210> 52 
<211> 15 
<212> DNA 

<213> Homo sapiens 



<400> 52 

cctttcaagg ggcct 



15 



<210> 53 
<211> 16 
<212> DNA 
<213> Homo 



sapiens 



<400> 53 

aagactcgag tgtcct 



16 



<210> 54 
<211> 16 
<212> DNA 
<213> Homo 



sapiens 



<400> 54 

agactcaagt gtcctc 



16 



<210> 55 

<211> 533 

<212> DNA 

<213> Homo sapiens 



<400> 55 

ttcgtctcag tttgtttgtg agcaggctgt gagtttgggc cccagaggct gggtgacatg 60 



iOKjttS&i .013003 
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tgttggcagc ctcttcaaaa tgagccctgt cctgcctaag gctgaacttg ttttctggga 120 
acaccatagg tcacctttat tctggcagag gagggagcat cagtgccctc caggatagac 180 
ttttcccaag cctacttttg ccattgactt cttcccaaga ttcaatccca ggatgtacaa 240 
ggacagcccc tcctccatag tatgggactg gcctctgctg atcctcccag gcttccgtgt 300 
gggtcagtgg ggcccatgga tgtgcttgtt aactgagtgc cttttggtgg agaggcccgg 360 
cctctcacaa aagacccctt accactgctc tgatgaagag gagtacacag aacacataat 420 
tcaggaagca gctttcccca tgtctcgact catccatcca ggccattccc cgtctctggt 480 
tcctcccctc ctcctggact cctgcacacg ctccttcctc tgaggctgaa att 533 




<210> 56 

<211> 497 

<212> DNA 

<213> Homo sapiens 



<400> 56 

gggccccaga ggctgggtga catgtgttgg 
taaggctgaa cttgttttct gggaacacca 
gcatcagtgc cctccaggat agacttttcc 
aagattcaat cccaggatgt acaaggacag 
gctgatcctc ccaggcttcc gtgtgggtca 
gtgccttttg gtggagaggc ccggcctctc 
agaggagtac acagaacaca taattcagga 
tccaggccat tccccgtctc tggttcctcc 
cctctgaggc tgaaatt 



cagcctcttc aaaatgagcc ctgtcctgcc 60 
taggtcacct ttattctggc agaggaggga 120 
caagcctact tttgccattg acttcttccc 180 
cccctcctcc atagtatggg actggcctct 240 
gtggggccca tggatgtgct tgttaactga 300 
acaaaagacc ccttaccact gctctgatga 360 
agcagctttc cccatgtctc gactcatcca 420 
cctcctcctg gactcctgca cacgctcctt 480 

497 



<210> 57 

<211> 497 

<212> DNA 

<213> Homo sapiens 

<400> 57 

gggccccaga ggctgggtga catgtgttgg 
taaggctgaa cttgttttct gggaacacca 
gcatcagtgc cctccaggat agacttttcc 
aagattcaat cccaggatgt acaaggacag 
gctgatcctc ccaggcttcc gtgtgggtca 
gtgccttttg gtggagaggc ccggcctctc 
agaggagtac acagaacaca taattcagga 
tccaggccat tccccgtctc tggttcctcc 
cctctgaggc tgaaatt 



aagcctcttc aaaatgagcc ctgtcctgcc 60 
taggtcacct ttattctggc agaggaggga 120 
caagcctact tttgccattg acttcttccc 180 
cccctcctcc atagtatggg actggcctct 240 
gtggggccca tggatgtgct tgttaactga 300 
acaaaagacc ccttaccact gctctgatga 360 
agcagctttc cccatgtctc gactcatcca 420 
cctcctcctg gactcctgca cacgctcctt 480 

497 



<210> 58 

<211> 497 

<212> DNA 

<213> Homo sapiens 



<400> 58 

gggccccaga ggctgggtga catgtgttgg 
taaggctgaa cttgttttct gggaacacca 
gcatcagtgc cctccaggat agacttttcc 
aagattcaat cccaggatgt acaaggacag 
gctgatcctc ccaggcttcc gtgtgggtca 
gtgccttttg gtggagaggc ccggcctctc 



cagcctcttc aaaatgagcc ctgtcctgcc 60 
taggtcacct ttattctggc agaggaggga 120 
caagcctact "tttgccattg acttcttccc 180 
cccctcctcc atagtatggg actggcctct 240 
gtggggccca tggatgtgct tgttaactga 300 
acaaaagacc ccttaccact gctctgatga 360 



* 
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agaggagtac acagaacaca taattcagga agcagctttc cccatgtcfcc gactcatcca 420 
tccaggccat tccccgtctc tggttcctcc cctcctcctg gactcctgca cacgctcctt 480 
cctctgaggg tgaaatt 497 

<210> 59 
<211> 483 
<212> DNA 

<213> Homo sapiens 



gggtgacatg tgttggcagc ctcttcaaaa tgagccctgt cctgcctaag gctgaacttg 60 

ttttctggga acaccatagg tcacctttat tctggcagag gagggagcat cagtgccctc 120 

caggatagac ttttcccaag cctacttttg ccattgactt cttcccaaga ttcaatccca 180 

ggatgtacaa ggacagcccc tcctccatag tatgggactg gcctctgctg atcctcccag 240 

gcttccgtgt gggtcagtgg ggcccatgga tgtgcttgtt aactgagtgc cttttggtgg 300 

agaggcccgg cctctcacaa aagacccctt accactgctc tgatgaagag gagtacacag 360 

aacacmtaat tcaggaagca gctttcccca tgtctcgact catccatcca ggccattccc 420 
cgtctctggt tcctcccctc ctcctggact cctgcacacg ctccttcctc tgaggctgaa 480 



<210> 60 

<211> 500 

<212> DNA 

<213> Homo sapiens 

<400> 60 

tttgggcccc agaggctggg tgacatgtgt tggcagcctc ttcaaaatga gccctgtcct 60 
gcctaaggct gaacttgttt tctgggaaca ccataggtca cctttattct ggcagaggag 120 
ggagcatcag tgccctccag gatagacttt tcccaagcct acttttgcca ttgacttctt 180 
cccaagattc aatcccagga tgtacaagga cagcccctcc tccatagtat gggactggcc 240 
tctgctgatc ctcccaggct tccgtgtggg tcagtggggc ccatggatgt gcttgttaac 300 
tgagtgcctt ttggtggaga ggcccggcct ctcacaaaag accccttmcc actgctctga 360 
tgaagaggag tacacagaac acataattca ggaagcagct ttccccatgt ctcgactcat 420 
ccatccaggc cattccccgt ctctggttcc tcccctcctc ctggactcct gcacacgctc 480 
cttcctctga ggctgaaatt 500 

<210> 61 

<211> 499 

<212> DNA 

<213> Homo sapiens 



ttgggcccca gaggctgggt gacatgtgtt ggcagcctct tcaaaatgag ccctgtcctg 60 

cctaaggctg aacttgtttt ctgggaacac cataggtcac ctttattctg gcagaggagg 120 

gagcatcagt gccctccagg atagactttt cccaagccta cttttgccat tgacttcttc 180 

ccaagattca atcccaggat gtacaaggac agcccctcct ccatagtatg ggactggcct 240 

ctgctgatcc tcccaggctt ccgtgtgggt cagtggggcc catggatgtg cttgtfcaact 300 

gagtgccttt tggtggagag gcccggcctc tcacaaaaga ccccttmcca ctgctctgat 360 
gaagaggagt acacagaaca cataattcag gaagcagctt tccccatgtc tcgactcatc 420 

catccaggcc attccccgtc tctggttcct cccctcctcc tggactcctg cacacgctcc 480 
ttcctctgag gctgaaatt 499 



<400> 59 



att 



483 



<400> 61 



<210> 62 
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<211> 498 
<212> DNA 
<213> Homo sapiens 

<400> 62 

tgggccccag aggctgggtg acatgtgttg gcagcctctt caaaatgagc cctgtcctgc 60 
ctaaggctga acttgttttc tgggaacacc ataggtcacc tttattctgg cagaggaggg 120 
agcatcagtg ccctccagga tagacttttc ccaagcctac ttttgccatt gacttcttcc 180 
caagattcaa tcccaggatg tacaaggaca gcccctcctc catagtatgg gactgggctc 240 
tgctgatcct cccaggcttc cgtgtgggtc agtggggccc atggatgtgc ttgttaactg 300 
agtgcctttt ggtggagagg cccggcctct cacaaaagac cccttmccac tgctctgatg 360 
aagaggagta cacagaacac ataattcagg aagcagcttt ccccatgtct cgactcatcc 420 
atccaggcca ttccccgtct ctggttcctc ccctcctcct ggactcctgc acacgctcct 480 
tcctctgagg ctgaaatt 498 

<210> 63 

<211> 498 

<212> DNA 

<213> Homo sapiens 



tgggccccag aggctgggtg acatgtgttg gcagcctctt caaaatgagc cctgtcctgc 60 

ctaaggctga acttgttttc tgggaacacc ataggtcacc tttattctgg cagaggaggg 120 

agcatcagtg ccctccagga tagacttttc ccaagcctac ttttgccatt gacttcttcc 180 

caagattcaa tcccaggatg tacaaggaca gcccctcctc catagtatgg gactggcctc 240 

tgctgatcct cccaggcttc cgtgtgggtc agtggggccc atggatgtgc ttgttaactg 300 

agtgcctttt ggtggagagg cccggcctct cacaaaagac cccttmccac tgctctgatg 360 

aagaggagta cacagaacac ataattcagg aagcagcttt ccccatgtct cgactcatcc 420 

atccaggcca ttccccgtct ctggttcctc ccctcctcct ggactcctgc acacgctcct 480 
tcctctgagg ctgaaatt 498 

<210> 64 

<211> 498 

<212> DNA 

<213> Homo sapiens 



tgggccccag aggctgggtg acatgtgttg gcagcctctt caaaatgagc cctgtcctgc 60 
ctaaggctga acttgttttc tgggaacacc ataggtcacc tttattctgg cagaggaggg 120 
agcatcagtg ccctccagga tagacttttc ccaagcctac ttttgccatt gacttcttcc 180 
caagattcaa tcccaggatg tacaaggaca gcccctcctc catagtatgg gactggcctc 240 
tgctgatcct cccaggcttc cgtgtgggtc agtggggccc atggatgtgc ttgttaactg 300 
agtgcctttt ggtggagagg cccggcctct cacaaaagac cccttaccac tgctctgatg 360 
aagaggagta cacagaacac ataattcagg aagcagcttt ccccatgtct cgactcatcc 420 
atccaggcca ttccccgtct ctggttcctc ccctcctcct. ggactcctgc acacgctcct 480 
tcctctgagg ctgaaatt 498 

<210> 65 

<211> 503 

<212> DNA 

<213> Homo sapiens 



<400> 63 



<400> 64 



<400> 65 
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gagtttgggc cccagaggct gggtgacatg tgttggcagc ctcttcaaaa tgagccctgt 60 
cctgcctaag gctgaacttg ttttctggg-a acaccatagg tcacctttat tctggcagag 120 
gagggagcat cagtgccctc caggatagac ttttcccaag cctacttttg ccattgactt 180 
cttcccaaga ttcaatccca ggatgtacaa ggacagcccc tcctccatag tatgggactg 240 
gcctctgctg atcctcccag gcttccgtgt gggtcagtgg ggcccatgga tgtgcttgtt 300 
aactgagtgc cttttggtgg agaggcccgg cctctcacaa aagacccctt cccactgctc 360 
tgatgaagag gagtacacag aacacataat tcaggaagca gctttcccca tgtctcgact 420 
catccatcca ggccattccc cgtctctggt tcctcccctc ctcctggact cctgcacacg 480 
ctccttcctc tgaggctgaa att 503 



<2ll> 453 
<212> DNA 
<213> Homo sapiens 

<400> 66 

tgagccctgt cctgcctaag gctgaacttg ttttctggga acaccatagg tcacctttat 60 
tctggcagag gagggagcat cagtgccctc caggatagac ttttcccaag cctacttttg 120 
ccattgactt cttcccaaga ttcaatccca ggatgtacaa ggacagcccc tcctccatag 180 
tatgggactg gcctctgctg atcctcccag gcttccgtgt gggtcagtgg ggcccatgga 240 
tgtgcttgtt aactgagtgc cttttggtgg agaggcccsg cctctcacaa aagacccctt 300 
cccactgctc tgatgaagag gagtacacag aacacataat tcaggaagca gctttcccca 360 
tgtctcgact catccatcca ggccattccc cgtctctggt tcctcccctc ctcctggact 420 
cctgcacacg ctccttcctc tgaggctgaa att 453 

<210> 67 
<211> 500 
<212> DNA 

<213> Homo sapiens 



tttgggcccc agaggctggg tgacatgtgt tggcagcctc ttcaaaatga gccctgtcct 60 
gcctaaggct gaacttgttt tctgggaaca ccataggtca cctttattct ggcagaggag 120 
ggagcatcag tgccctccag gatagacttt tcccaagcct acttttgcca ttgacttctt 180 
cccaagattc aatcccagga tgtacaagga cagcccctcc tccatagtat gggactggcc 240 
tctgctgatc ctcccaggct tccgtgtggg tcagtggggc ccatggatgt gcttgttaac 300 
tgagtgcctt ttggtggaga ggcccggcct ctcacaaaag accccttmcc actgctctga 360 
tgaagaggag tacacagaac acataattca ggaagcagct ttccccatgt ctcgactcat 420 
ccatccaggc cattccccgt ctctggttcc tcccctcctc ctggactcct gcacacgctc 480 
cttcctctga gggtgaaatt 500 



<210> 66 



<400> 67 
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